OpenAI Will No Longer Use Customer Data To Train Its Models by Default (techcrunch.com) 15
OpenAI is changing the terms of its API developer policy, aiming to address developer -- and user -- criticism. From a report: Starting today, OpenAI says that it won't use any data submitted through its API for "service improvements," including AI model training, unless a customer or organization opts in. In addition, the company is implementing a 30-day data retention policy for API users with options for stricter retention "depending on user needs," and simplifying its terms and data ownership to make it clear that users own the input and output of the models. Greg Brockman, the president and chairman of OpenAI, asserts that some of these changes aren't changes necessarily -- it's always been the case that OpenAI API users own input and output data, whether text, images or otherwise. But the emerging legal challenges around generative AI and customer feedback prompted a rewriting of the terms of service, he says.
So? (Score:2)
Re:So? (Score:5, Funny)
Now is no time to piss off Skynet
Re: (Score:2)
What are the rules for scraping data off the Internet? I don't see why AI would have any special rules.
The rules are that if they can get your date they will used it and if you don't like that you can can try to sue them but their legion of lawyers and their inexhaustible cash supply will ensue that you will be bankrupt long before they even notice that a small portion of their chump change is missing.
Re: (Score:3)
The rules are, scrape all data until asked if that's ethical. Then? Scrape all data but provide the end-user a checkbox that says "don't scrape my data."
Too bad (Score:3)
Lawyers ruin everything again. What else is new.
Re: (Score:2)
When I read this, I thought it was less about lawyers and more about "poisoning the well".
Taybot lasted less than a day when end users could directly influence its training.
So competitors paying people to enter in garbage that breaks the model is in the realm of possibility.
Re: (Score:2)
Re: (Score:2)
Re: (Score:3)
edible posts are the future
Re: (Score:2)
This is unfortunate (Score:3)
Re:This is unfortunate (Score:4, Informative)
Ownership? (Score:2)
...simplifying its terms and data ownership to make it clear that users own the input and output of the models.
I read that courts have been ruling that LLM & AI output isn't & can't be copyright.
New pool of data? (Score:2)
Could it be they have a new pool of data now? I mean, it's easy to give up what you no longer need.