A long list of tech companies are rushing to give themselves the right to use people’s data to train AI::More companies are quietly giving themselves permission to use consumer data to train generative AI models and tools.

  • BertramDitore@lemmy.world
    link
    fedilink
    English
    arrow-up
    38
    ·
    1 year ago

    Fuck all this noise. I “give myself” the right to never say a nice thing about these duplicitous asshats. I wish I could update a document to make it legal to steal shit from millions of people.

    Ethics don’t matter to these pieces of crap, despite all the pandering. Don’t ever trust what they say. Just watch what they do, and more importantly, what they don’t do.

  • Ethalia@feddit.ch
    link
    fedilink
    English
    arrow-up
    14
    arrow-down
    1
    ·
    1 year ago

    Give or take how much time do you guys think until Meta or any other big corp starts scraping and selling data from the Fediverse?

  • AutoTL;DR@lemmings.worldB
    link
    fedilink
    English
    arrow-up
    9
    ·
    1 year ago

    This is the best summary I could come up with:


    Over the last couple of months, companies as varied as Twitter, or X, Microsoft, Instacart, Meta, and Zoom have rushed to update their terms of service and/or privacy policies to allow the collection of information and content from people and customers as data to train generative artificial intelligence models.

    Tweets, web searches and apparently even grocery shopping are now an opportunity for companies to build more predictive tools like Bard and ChatGPT, which is owned by OpenAI and receives considerable backing from Microsoft.

    Users were only prompted to review updated Terms in September, in an email from the company announcing its partnership with OpenAI as “a new third-party sub-processor.”

    However, Instacart also added language that left it a window to do just that with its own customers’ data, saying its license now allows it to “…otherwise enhance our machine learning algorithms, for the purposes of operating, providing, and improving the services.”

    “We’re incorporating generative-AI experiences into our products to assist with customers’ grocery shopping questions and help them make food-related decisions,” the spokesperson said.

    At the end of August, it created a simple form where users could “request” to opt out of their data being used to train AI models.


    The original article contains 1,151 words, the summary contains 200 words. Saved 83%. I’m a bot and I’m open source!

  • Drop_All_Users@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    arrow-down
    8
    ·
    1 year ago

    I don’t see the issue with this, don’t give your data to companies if you don’t them to use it. No one is forcing us to use these services, if you don’t want twitter to train their AI off your tweets, then don’t tweet.

    • Dizzy Devil Ducky@lemm.ee
      link
      fedilink
      English
      arrow-up
      10
      ·
      1 year ago

      The problem is if you signed up for an account for a social media service years ago and they suddenly decide (without telling you or getting your consent) to start training off your data, there is nothing you can do if you don’t know it’s happening.

      If the admins of the largest Lemmy instance didn’t tell us that they were gonna use our posts and comments as AI training data and everybody was none the wiser, how would we react? We wouldn’t until someone finds out and spills the beans.

      • Drop_All_Users@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 year ago

        I’m actually fine with it, in the case of Lemmy this is all public data, whether or not Lemmy admins are training AI on it, there is nothing to stop me from training my own AI models with this data.

        I think the larger issue is I don’t consider it “your data” once you put it on one of these sites. As soon as you take your own thought and put it on facebook/instagram/reddit/whatever, it’s now theirs, it lives in their databases, and frankly for a social media company it’s probably their most valuable asset.

        No one is forcing anyone to use social media, if you want your thoughts and actions to be your own I would recommend not putting them on the internet.