


Reddit Wants 'Deeper Integration' with Google in Exchange for Licensed AI Training Data (msn.com) 21
Reddit's content became AI training data last year when Google signed a $60 million-per-year licensing agreement. But now Reddit is "in early talks" about a new deal seeking "deeper integration with Google's AI products," reports Bloomberg (citing executives familiar with the discussions).
And Reddit also wants "a deal structure that could allow for dynamic pricing, where the social platform can be paid more" — with both Google and OpenAI — to "adequately reflect how valuable their data has been to these platforms..." Such licensing agreements are becoming more common as AI companies seek legal ways to train their models. OpenAI has also struck a series of partnership agreements with major media publishers such as Axel Springer SE, Time and Conde Nast to use their content in ChatGPT...
Reddit remains among the most cited sources across AI platforms, according to analytics company Profound AI. However, Reddit executives have noticed that traffic coming from Google has limited value, as users seeking answers to a specific question often don't convert into becoming active Redditors, the people said. Now, Reddit is engaging with product teams at Google in hopes of finding ways to send more of its users deeper into its ecosystem of community forums, according to the executives. In return, Reddit is looking for ways to provide more high-quality data to its AI partners. Discussions between Reddit and Google have been productive, the people said. "We're midflight in our data licensing deals and still learning, but what we have seen is that Reddit data is highly cited and valued," Reddit Chief Operating Officer Jen Wong said on July 31 during a call with investors. "We'll continue to evaluate as we go."
And Reddit also wants "a deal structure that could allow for dynamic pricing, where the social platform can be paid more" — with both Google and OpenAI — to "adequately reflect how valuable their data has been to these platforms..." Such licensing agreements are becoming more common as AI companies seek legal ways to train their models. OpenAI has also struck a series of partnership agreements with major media publishers such as Axel Springer SE, Time and Conde Nast to use their content in ChatGPT...
Reddit remains among the most cited sources across AI platforms, according to analytics company Profound AI. However, Reddit executives have noticed that traffic coming from Google has limited value, as users seeking answers to a specific question often don't convert into becoming active Redditors, the people said. Now, Reddit is engaging with product teams at Google in hopes of finding ways to send more of its users deeper into its ecosystem of community forums, according to the executives. In return, Reddit is looking for ways to provide more high-quality data to its AI partners. Discussions between Reddit and Google have been productive, the people said. "We're midflight in our data licensing deals and still learning, but what we have seen is that Reddit data is highly cited and valued," Reddit Chief Operating Officer Jen Wong said on July 31 during a call with investors. "We'll continue to evaluate as we go."
Reminder that you are the product (Score:4, Insightful)
"We're midflight in our data licensing deals and still learning, but what we have seen is that Reddit data is highly cited and valued"
In case anyone has any misconceptions, its "Reddit data", not "Reddit's users contributions" or "community" or any other words people use to convince themselves they are part of something important, other than making the shareholders of Reddit wealthier. Which is fine if that's what your kick is.
Re:Reminder that you are the product (Score:4, Informative)
"We're midflight in our data licensing deals and still learning, but what we have seen is that Reddit data is highly cited and valued"
In case anyone has any misconceptions, its "Reddit data", not "Reddit's users contributions" or "community" or any other words people use to convince themselves they are part of something important,
Reddit gains an irrevocable, sub-licensable, royalty-free license for user-created content posted on its site. So while the term "Reddit data" is somewhat inaccurate, it is a passable approximation of the truth. The more correct term would be "Data that Reddit has irrevocable, sub-licensable, royalty-free license to". But where's the fun in saying that? :-)
"Reddit data" does obscure the fact that the Reddit user retains full copyright ownership of their comments. So if a user wished to, they could independently license their content to GOOG, AMZN, META, MSFT et al. Theoretically, all the other users in a Reddit thread could make same decision, cutting Reddit out of the licensing loop entirely.
But the logistics of organising this are onerous. Even if a few crucial users in a thread refuse to license content, there will be 'gaps' and intelligibility and utility for AI training will suffer.
But if replies briefly quote the specific content they are responding to (as I am doing here), the context becomes much more clear. In that case, individual comments become much more intelligible and useful. Legally speaking, there should be no problem here because brief quotations fall under fair use.
Concievably, GOOG and other browsers manufacturers could offer to store your user-generated content in a browser repository ("keep a record what you wrote", like Windows Recall) or in the cloud, with the option of licensing the content to GOOG. They could also generate AI-summaries of the context you were responding to.
Re:Reminder that you are the product (Score:4)
Re: (Score:3)
Reddit gains an irrevocable, sub-licensable, royalty-free license for user-created content posted on its site.
You mean Reddit claims to have an irrevocable royalty-free license. Based on an assumption some clause they stuck in the fine print of a user agreement would be upholdable against every user of their website - including for completely unforseen users by users who posted them 5, 10 years ago, etc --- Including some users who are unaware of the terms at the time of uploading items to Reddit's serv
No. (Score:2, Troll)
Google does not want its A.I. to "inherit" Reddit's users' lop-sided dysfunctionality!
Never contribute (Score:5, Insightful)
Itâ(TM)s a hard and fast rule to never contribute to the discussions Google sends me to for this very reason. Especially Reddit.
Time to take your balls and go home people. Take but never give back.
Temporary Value. Nothing more. (Score:1)
And Reddit also wants "a deal structure that could allow for dynamic pricing, where the social platform can be paid more" — with both Google and OpenAI — to "adequately reflect how valuable their data has been to these platforms..."
Well that's some delusional bullshit to say the least.
Funny thing about training; Nothing has a need to train forever. If you’re human and endlessly training for a job, then you’re probably not actually learning. You’re struggling to learn and not absorbing enough to be able to do the job after training is adequate.
Now ask yourself. Why does anyone think AI will need to train in licensed perpetuity, in order to become Good E-Nuff AI that will take 50% of human jobs rather easily?
AI hard
Re:Temporary Value. Nothing more. (Score:5, Insightful)
GIGO (Score:4)
Delete your Reddit history (Score:2)
Stop feeding AI and use a tool like Redact [redact.dev] to delete your old Reddit comments. It will also help keep your online footprint small, help maintain your anonymity and reduce the risk of you being doxxed.
Re: (Score:2)
He says while feeding the AI's with his /. comments. (/irony)
Re: (Score:3)
Reddit is blocking your ability to delete your comments, it will only remove them from your view, nothing more.
Because way too many started to delete anything useful and right now, most refuse to post anything on Social Media, so only crap is generated, mostly from the failed part of society who has nothing better to do then to write crap on the net.
Re: (Score:2)
aye, editing is a maybe. But as someone who has deleted several accounts over the decade, I know what a PITA the process is. Error codes and timeouts on editing are getting more annoying by the day and now with the tech, an AI agent can flag you immediately for your erratic behavior.
The only win is to post 0 useful comments on Social Media, just brain farts to bring the cesspool of bots into the ground.
i also told ChatGPT to ignore Reddit in web searches but he has no wish to comply.
Why is Reddit Spam worth anything? (Score:4)
I ran bbs's for over a decade.
Then worked for a national computer manufactures', running their 24 line support bbs in 1990 for 5 years.
Put my first forum (bbs) online in 1998.
Built it into a million page views a day system (that is still running today).
What I am saying is I know online communities and the quality of those communities.
Somewhere along there, I missed a day in school or something? How the f* is reddit garbage worth this to Google? Google paying reddit is so absurd to me, that I simply can't fathom it. This absolutely makes no sense to me why Google would want this. The only value of reddit I see, is to spam it with backlinks.
Explain it to me like I am 5 please.
Re: (Score:3)
Ok redit folk (Score:5, Insightful)
Alter the deal (Score:5, Insightful)
How did Reddit think, the same wouldn't happen to them?
At best, Reddit was selling the answers (of its subscribers) to Google. Now, Reddit realizes, they've priced their answers incorrectly and want a new deal.
To keep selling their answers, Reddit needed a way to the encourage skilled subscribers to answer more questions. Instead, they took control of moderation and kicked-out anyone who complained. At the time, many people declared that Reddit needed subscribers more than subscribers needed Reddit.
I expect that to have consequences, too.