Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
AI Social Networks Google

Reddit Wants 'Deeper Integration' with Google in Exchange for Licensed AI Training Data (msn.com) 21

Reddit's content became AI training data last year when Google signed a $60 million-per-year licensing agreement. But now Reddit is "in early talks" about a new deal seeking "deeper integration with Google's AI products," reports Bloomberg (citing executives familiar with the discussions).

And Reddit also wants "a deal structure that could allow for dynamic pricing, where the social platform can be paid more" — with both Google and OpenAI — to "adequately reflect how valuable their data has been to these platforms..." Such licensing agreements are becoming more common as AI companies seek legal ways to train their models. OpenAI has also struck a series of partnership agreements with major media publishers such as Axel Springer SE, Time and Conde Nast to use their content in ChatGPT...

Reddit remains among the most cited sources across AI platforms, according to analytics company Profound AI. However, Reddit executives have noticed that traffic coming from Google has limited value, as users seeking answers to a specific question often don't convert into becoming active Redditors, the people said. Now, Reddit is engaging with product teams at Google in hopes of finding ways to send more of its users deeper into its ecosystem of community forums, according to the executives. In return, Reddit is looking for ways to provide more high-quality data to its AI partners. Discussions between Reddit and Google have been productive, the people said. "We're midflight in our data licensing deals and still learning, but what we have seen is that Reddit data is highly cited and valued," Reddit Chief Operating Officer Jen Wong said on July 31 during a call with investors. "We'll continue to evaluate as we go."

Reddit Wants 'Deeper Integration' with Google in Exchange for Licensed AI Training Data

Comments Filter:
  • by Anonymous Coward on Monday September 22, 2025 @04:15AM (#65675370)

    "We're midflight in our data licensing deals and still learning, but what we have seen is that Reddit data is highly cited and valued"

    In case anyone has any misconceptions, its "Reddit data", not "Reddit's users contributions" or "community" or any other words people use to convince themselves they are part of something important, other than making the shareholders of Reddit wealthier. Which is fine if that's what your kick is.

    • by sonamchauhan ( 587356 ) <sonamc&gmail,com> on Monday September 22, 2025 @05:17AM (#65675404) Journal

      "We're midflight in our data licensing deals and still learning, but what we have seen is that Reddit data is highly cited and valued"

      In case anyone has any misconceptions, its "Reddit data", not "Reddit's users contributions" or "community" or any other words people use to convince themselves they are part of something important,

      Reddit gains an irrevocable, sub-licensable, royalty-free license for user-created content posted on its site. So while the term "Reddit data" is somewhat inaccurate, it is a passable approximation of the truth. The more correct term would be "Data that Reddit has irrevocable, sub-licensable, royalty-free license to". But where's the fun in saying that? :-)

      "Reddit data" does obscure the fact that the Reddit user retains full copyright ownership of their comments. So if a user wished to, they could independently license their content to GOOG, AMZN, META, MSFT et al. Theoretically, all the other users in a Reddit thread could make same decision, cutting Reddit out of the licensing loop entirely.

      But the logistics of organising this are onerous. Even if a few crucial users in a thread refuse to license content, there will be 'gaps' and intelligibility and utility for AI training will suffer.

      But if replies briefly quote the specific content they are responding to (as I am doing here), the context becomes much more clear. In that case, individual comments become much more intelligible and useful. Legally speaking, there should be no problem here because brief quotations fall under fair use.

      Concievably, GOOG and other browsers manufacturers could offer to store your user-generated content in a browser repository ("keep a record what you wrote", like Windows Recall) or in the cloud, with the option of licensing the content to GOOG. They could also generate AI-summaries of the context you were responding to.

      • by diffract ( 7165501 ) on Monday September 22, 2025 @06:11AM (#65675440)
        I haven't used Reddit a lot but from the few years I've been on it, the admins usually do something that pisses off the users. Users retaliate by jumping ship to an alternative site put together in a weekend. A few hours or days pass by, people forgot whatever Reddit management has done and go back to using the site normally.
      • by mysidia ( 191772 )

        Reddit gains an irrevocable, sub-licensable, royalty-free license for user-created content posted on its site.

        You mean Reddit claims to have an irrevocable royalty-free license. Based on an assumption some clause they stuck in the fine print of a user agreement would be upholdable against every user of their website - including for completely unforseen users by users who posted them 5, 10 years ago, etc --- Including some users who are unaware of the terms at the time of uploading items to Reddit's serv

  • No. (Score:2, Troll)

    by eclectro ( 227083 )

    Google does not want its A.I. to "inherit" Reddit's users' lop-sided dysfunctionality!

  • Never contribute (Score:5, Insightful)

    by AlexSledge ( 10102306 ) on Monday September 22, 2025 @05:28AM (#65675412)

    Itâ(TM)s a hard and fast rule to never contribute to the discussions Google sends me to for this very reason. Especially Reddit.

    Time to take your balls and go home people. Take but never give back.

  • And Reddit also wants "a deal structure that could allow for dynamic pricing, where the social platform can be paid more" — with both Google and OpenAI — to "adequately reflect how valuable their data has been to these platforms..."

    Well that's some delusional bullshit to say the least.

    Funny thing about training; Nothing has a need to train forever. If you’re human and endlessly training for a job, then you’re probably not actually learning. You’re struggling to learn and not absorbing enough to be able to do the job after training is adequate.

    Now ask yourself. Why does anyone think AI will need to train in licensed perpetuity, in order to become Good E-Nuff AI that will take 50% of human jobs rather easily?

    AI hard

  • by Shakes Fist ( 10502847 ) on Monday September 22, 2025 @06:38AM (#65675452)
    Google bot is going to learn from Reddit bots? How does that help anything?
  • Stop feeding AI and use a tool like Redact [redact.dev] to delete your old Reddit comments. It will also help keep your online footprint small, help maintain your anonymity and reduce the risk of you being doxxed.

    • >Stop feeding AI

      He says while feeding the AI's with his /. comments. (/irony)

    • by Krneki ( 1192201 )

      Reddit is blocking your ability to delete your comments, it will only remove them from your view, nothing more.

      Because way too many started to delete anything useful and right now, most refuse to post anything on Social Media, so only crap is generated, mostly from the failed part of society who has nothing better to do then to write crap on the net.

  • by TheWho79 ( 10289219 ) on Monday September 22, 2025 @07:33AM (#65675518)
    Put my first bbs online in 1982 on a Commodore 64.
    I ran bbs's for over a decade.
    Then worked for a national computer manufactures', running their 24 line support bbs in 1990 for 5 years.
    Put my first forum (bbs) online in 1998.
    Built it into a million page views a day system (that is still running today).
    What I am saying is I know online communities and the quality of those communities.
    Somewhere along there, I missed a day in school or something? How the f* is reddit garbage worth this to Google? Google paying reddit is so absurd to me, that I simply can't fathom it. This absolutely makes no sense to me why Google would want this. The only value of reddit I see, is to spam it with backlinks.

    Explain it to me like I am 5 please.

    • Conversational channels might be useful in online 'authenticity', niche channels are one of the few places on the internet to find good crowdsourced information unless you find some good old fashioned forums.
  • Ok redit folk (Score:5, Insightful)

    by zkiwi34 ( 974563 ) on Monday September 22, 2025 @08:17AM (#65675552)
    I wonder if you knew that Redit could and would sell off all your data. Once more the users become the product.
  • Alter the deal (Score:5, Insightful)

    by NotEmmanuelGoldstein ( 6423622 ) on Monday September 22, 2025 @09:46AM (#65675652)
    When Google started providing on-page excerpts of the destination web-site, it was the end of Google respecting anybody. Those web-sites complained that Google was stealing/blocking traffic. Australia made a law forcing Google to pay for content. Google made a deal to thwart similar legislation being enacted in other countries.

    How did Reddit think, the same wouldn't happen to them?

    At best, Reddit was selling the answers (of its subscribers) to Google. Now, Reddit realizes, they've priced their answers incorrectly and want a new deal.

    To keep selling their answers, Reddit needed a way to the encourage skilled subscribers to answer more questions. Instead, they took control of moderation and kicked-out anyone who complained. At the time, many people declared that Reddit needed subscribers more than subscribers needed Reddit.

    I expect that to have consequences, too.

"The voters have spoken, the bastards..." -- unknown

Working...