OpenAI Will No Longer Use Customer Data To Train Its Models by Default (techcrunch.com) 15

Posted by msmash on Wednesday March 01, 2023 @02:44PM from the how-about-that dept.

OpenAI is changing the terms of its API developer policy, aiming to address developer -- and user -- criticism. From a report: Starting today, OpenAI says that it won't use any data submitted through its API for "service improvements," including AI model training, unless a customer or organization opts in. In addition, the company is implementing a 30-day data retention policy for API users with options for stricter retention "depending on user needs," and simplifying its terms and data ownership to make it clear that users own the input and output of the models. Greg Brockman, the president and chairman of OpenAI, asserts that some of these changes aren't changes necessarily -- it's always been the case that OpenAI API users own input and output data, whether text, images or otherwise. But the emerging legal challenges around generative AI and customer feedback prompted a rewriting of the terms of service, he says.

OpenAI Will No Longer Use Customer Data To Train Its Models by Default

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 15 Comments Log In/Create an Account

Comments Filter:

So? (Score:2)

by oldgraybeard ( 2939809 ) writes:

What are the rules for scraping data off the Internet? I don't see why AI would have any special rules.
- Re:So? (Score:5, Funny)
  
  by MightyMartian ( 840721 ) writes: on Wednesday March 01, 2023 @03:02PM (#63333529) Journal
  
  Now is no time to piss off Skynet
  
- Re: (Score:2)
  
  by Savage-Rabbit ( 308260 ) writes:
  
  What are the rules for scraping data off the Internet? I don't see why AI would have any special rules.
  The rules are that if they can get your date they will used it and if you don't like that you can can try to sue them but their legion of lawyers and their inexhaustible cash supply will ensue that you will be bankrupt long before they even notice that a small portion of their chump change is missing.
- Re: (Score:3)
  
  by nightflameauto ( 6607976 ) writes:
  
  The rules are, scrape all data until asked if that's ethical. Then? Scrape all data but provide the end-user a checkbox that says "don't scrape my data."
Too bad (Score:3)

by TwistedGreen ( 80055 ) writes: on Wednesday March 01, 2023 @03:17PM (#63333565)

Lawyers ruin everything again. What else is new.

- Re: (Score:2)
  
  by forgotten_my_nick ( 802929 ) writes:
  
  When I read this, I thought it was less about lawyers and more about "poisoning the well".
  Taybot lasted less than a day when end users could directly influence its training.
  So competitors paying people to enter in garbage that breaks the model is in the realm of possibility.
Re: (Score:2)

by account_deleted ( 4530225 ) writes:

Comment removed based on user account deletion
- Re: (Score:2)
  
  by account_deleted ( 4530225 ) writes:
  
  Comment removed based on user account deletion
  - Re: (Score:3)
    
    by zlives ( 2009072 ) writes:
    
    edible posts are the future
    - - Re: (Score:2)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
This is unfortunate (Score:3)

by JoshuaZ ( 1134087 ) writes: on Wednesday March 01, 2023 @03:44PM (#63333643) Homepage

There was no legal issue with using customer data to help train it, and it essentially gave them a growing pool where the more people who used it the better training data they had. This is a hamstring of the system of unclear advantage. They might be worried that someone would be hesitant to use the software if it makes it more likely that the information from that might leak into a later AI, but if that was the concern that could have been alleviated by an opt-out rather than opt-in system.

- Re:This is unfortunate (Score:4, Informative)
  
  by Fly Swatter ( 30498 ) writes: on Wednesday March 01, 2023 @05:14PM (#63333859) Homepage
  
  It is quite simply that they don't want to be sued into oblivion for the input data or output data that their tool uses or generates. This pushes the legal copyright infringement responsibilities and other such legal problems onto their clients.
  
Re: (Score:2)

by account_deleted ( 4530225 ) writes:

Comment removed based on user account deletion
New pool of data? (Score:2)

by freedom_surfer ( 203272 ) writes:

Could it be they have a new pool of data now? I mean, it's easy to give up what you no longer need.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

OpenAI Will No Longer Use Customer Data To Train Its Models by Default (techcrunch.com) 15

OpenAI Will No Longer Use Customer Data To Train Its Models by Default More Login

OpenAI Will No Longer Use Customer Data To Train Its Models by Default

So? (Score:2)

Re:So? (Score:5, Funny)

Re: (Score:2)

Re: (Score:3)

Too bad (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

This is unfortunate (Score:3)

Re:This is unfortunate (Score:4, Informative)

Re: (Score:2)

New pool of data? (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot