Sarah Silverman Sues Meta, OpenAI for Copyright Infringement (reuters.com) 163

Posted by msmash on Monday July 10, 2023 @10:45AM from the how-about-that dept.

Comedian Sarah Silverman and two authors have filed copyright infringement lawsuits against Meta and OpenAI for allegedly using their content without permission to train artificial intelligence language models. From a report: The proposed class action lawsuits filed by Silverman, Richard Kadrey and Christopher Golden in San Francisco federal court Friday allege Facebook parent company Meta and ChatGPT maker OpenAI used copyrighted material to train chat bots. The lawsuits underscore the legal risks developers of chat bots face when using troves of copyrighted material to create apps that deliver realistic responses to user prompts. Silverman, Kadrey and Golden allege Meta and OpenAI used their books without authorization to develop their so-called large language models, which their makers pitch as powerful tools for automating tasks by replicating human conversation. In their lawsuit against Meta, the plaintiffs allege that leaked information about the company's artificial intelligence business shows their work was used without permission.

This discussion has been archived. No new comments can be posted.

Sarah Silverman Sues Meta, OpenAI for Copyright Infringement

Load All Comments

Search 163 Comments Log In/Create an Account

Comments Filter:

Copyright Infringement. Can I also sue? (Score:5, Insightful)

by m00sh ( 2538182 ) writes: on Monday July 10, 2023 @10:48AM (#63673769)

Can I also sue?
I'm sure slashdot also got scraped. Since slashdot says I own my posts, I'm sure my works are in the dataset as well.

Share
twitter facebook
- Re:Copyright Infringement. Can I also sue? (Score:5, Funny)
  
  by Narcocide ( 102829 ) writes: on Monday July 10, 2023 @10:55AM (#63673801) Homepage
  
  Definitely, but to make it worth the money you're gonna have to prove something of value was stolen.
  
  Parent Share
  twitter facebook
  - Re: (Score:3)
    
    by AleRunner ( 4556245 ) writes:
    
    Definitely, but to make it worth the money you're gonna have to prove something of value was stolen.
    I can see you haven't been paying any attention to the various RIAA stories on here. Copyright is very carefully designed to avoid any need to prove value. Ask the guys at pirate bay.
    - Re: (Score:2)
      
      by Bahbus ( 1180627 ) writes:
      
      The RIAA loves to weaponize copyright law, but it's not as powerful as people like to think it is. These authors and their lawyers clearly don't even understand the basics of copyright law.
      - Re: (Score:3)
        
        by The New Guy 2.0 ( 3497907 ) writes:
        
        Sarah Silverman is not a newbie here. She was around during the RIAA/MPAA copyright debates.
        You know, as foul-mouthed as she historically has been, she's actually Sesame Street trained in performing. There was a time Comedy Central was stand-up based instead of movie based and she was there. She's also a part of Crank Yankers, which was definitely a Sesame Workshop project by its use of puppets.
        Seems like we need to bring back a few of these copyright-aware performers, such as the Analog Hole group because
        
        Re: (Score:2)
        
        by sjames ( 1099 ) writes:
        
        But what if the AI doesn't steal your joke, it just parses it, makes minute changes to a matrix of numbers (or perhaps not even that) and moves on. It's really no different than a person hearing your joke.
        Unless the AI spits the exact joke out again, there isn't even the vaguest hint of a copyright violation.
        
        Re: (Score:2)
        
        by The New Guy 2.0 ( 3497907 ) writes:
        
        AI can't steal, humans program it to steal.
      - Re: (Score:2)
        
        by codebase7 ( 9682010 ) writes:
        
        The RIAA
        These authors and their lawyers clearly don't even understand the basics of copyright law.
        They don't need to. They wrote it and will have it altered if necessary.
        
        Further, the OpenAI groups and others like it have really made a basic blunder here with their laissez faire scraping. I'd imagine they will have a very difficult time in court. With increasing scrutiny from the bench as more and more groups sue them over copyright infringement. (To say nothing about the online services losing profits due to the scraping as well....)
        
        Re: (Score:3)
        
        by Bahbus ( 1180627 ) writes:
        
        Further, the OpenAI groups and others like it have really made a basic blunder here with their laissez faire scraping.
        This has nothing to do with copyright law. Whether anyone agrees with HOW they got their data or not, none of it was obtained by violating copyrights. They are suing on the theory that GPT could reproduce copyrighted material because it was trained on it. That isn't how LLMs work, at all. Nor do we file lawsuits on theoreticals that have never happened (unless you are extremely stupid).
        Now I will admit that perhaps the lawyers do know better but they do not care because they get paid regardless of whether t
    - Re: (Score:3)
      
      by Freischutz ( 4776131 ) writes:
      
      Definitely, but to make it worth the money you're gonna have to prove something of value was stolen.
      I can see you haven't been paying any attention to the various RIAA stories on here. Copyright is very carefully designed to avoid any need to prove value. Ask the guys at pirate bay.
      If people use PirateBay to download free media, pay for an internet connection needed to do that and then pay for a device with which to spend a portion of their lifespan that they will never get back to consuming that media, that media has value. Same for data that OpenAI scrapes from the net, same for the news summaries Google scrapes off of news sites and then pockets advertising bucks while showing that to their users knowing full well that most of them will not bother to click through to the content cr
      - Re: (Score:3)
        
        by suutar ( 1860506 ) writes:
        
        While you make a good argument for showing that the work has value, that doesn't change the fact that copyright law is structured to not need to prove that in court. Statutory damages are used more often, because the statutory values are generally higher than the highest reasonable value they could put on a given work.
        
        Re: (Score:2)
        
        by Freischutz ( 4776131 ) writes:
        
        If people use PirateBay to download free media, pay for an internet connection needed to do that and then pay for a device with which to spend a portion of their lifespan that they will never get back to consuming that media, that media has value. Same for data that OpenAI scrapes from the net, same for the news summaries Google scrapes off of news sites and then pockets advertising bucks while showing that to their users knowing full well that most of them will not bother to click through to the content creator. Nobody would bother to do any of this if this content didn't have value.
        While you make a good argument for showing that the work has value, that doesn't change the fact that copyright law is structured to not need to prove that in court. Statutory damages are used more often, because the statutory values are generally higher than the highest reasonable value they could put on a given work.
        The fact that you jumped through all those hoops and spent all that money to get the content and then spent a valuable and non-reclaimable portion of your lifespan consuming it isn't proof enough that you valued it? There are many things wrong with copyright law that need fixing but don't tell me that pirated/scraped content has no value to the people pirating/scraping it and if it has value to them it's not beyond the realms of reason to expect them to compensate original content creator in some way. Nobod
        
        Re: (Score:2)
        
        by suutar ( 1860506 ) writes:
        
        I didn't say they didn't value it, nor did I disagree with your statement. I'm just pointing out that they usually don't bother to prove that in court, and the law is structured so they don't have to.
        
        ppl want to get payed (Score:2)
        
        by Thud457 ( 234763 ) writes:
        
        "laughs sardonically in Ted Nelson"
        
        Re: (Score:2)
        
        by hawk ( 1151 ) writes:
        
        Not my area of law these days, but in the US, statutory damages are only available for post-registration damages.
        So most published stuff, yes, but most forum posts would be limited to statutory damages.
        hmm, now that I think of it, my dissertation was registered . . .
        hawk, esq.
      - Re: (Score:2)
        
        by Ungrounded Lightning ( 62228 ) writes:
        
        If people use PirateBay to download free media, pay for an internet connection needed to do that and then pay for a device with which to spend a portion of their lifespan that they will never get back to consuming that media, that media has value.
        The incremental cost of the bit of internet connection used to download a song or video is tiny, or zero if you're on a flat-rate line or under your cap. Even if you take the entire cost of the connection, including installation, the computer used to access it, an
        
        Especially since ... (Score:2)
        
        by Ungrounded Lightning ( 62228 ) writes:
        
        But it has value. Look what was spent to download and use it.
        [But their cost per post was tiny. US statutory damages would be at least $750 per post.] Take the statutory damages and run.
        Not to mention that the court wouldn't use your "what they spend to make an unauthorized copy" measure of actual value - and the ones the courts use are almost as hard to prove - which is why statutory minimum damages are part of the law.
- Re: (Score:2)
  
  by Entrope ( 68843 ) writes:
  
  Have you registered your copyrights with the US Copyright Office? If not, you cannot sue over them in federal courts. (You only have to register them before the lawsuit, not before the alleged infringement. https://www.natlawreview.com/a... [natlawreview.com])
  - Re: (Score:2)
    
    by m00sh ( 2538182 ) writes:
    
    You don't have to register to copyright.
    It is automatically copyrighted upon creation.
    - Re: (Score:2)
      
      by StormReaver ( 59959 ) writes:
      
      You don't have to register to copyright.
      That's true. But it's also true that you have to register your copyrights in order to sue in Federal court.
- Re: (Score:2)
  
  by znrt ( 2424692 ) writes:
  
  Can I also sue?
  please do! the more the merrier.
  this is getting better and better, nothing to date has shown in a more hilarious way what an aberration the clusterfuck of copyright laws really is. and all it took was a chatbot! comedy gold. oh wait, somebody already said that? sue me!
- Re:Copyright Infringement. Can I also sue? (Score:4, Insightful)
  
  by thegarbz ( 1787294 ) writes: on Monday July 10, 2023 @01:51PM (#63674601)
  
  Of course you can, but like Sarah Silverman you will lose because no element of copyright covers the idea of limiting what a person can do with information they obtain. You can copyright availability and reproduction, but you can't copyright the idea that someone may learn something. If ChatGPT was spitting our Silverman's pros verbatim you can argue copyright, if on the other hand they paraphrased or simply replicated a style it is covered under being transformative.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by znrt ( 2424692 ) writes:
    
    Of course you can, but like Sarah Silverman you will lose because no element of copyright covers the idea of limiting what a person can do with information they obtain.
    i wouldn't be so quick in assuming courts can reasonably resolve this dilemma in a a rational way in a context where fundamental concepts have been distorted to the extreme for spurious reasons. now wait for not just his, but the incoming tsunami of complaints and even class action suits ...
    this is going to bite the whole industry in the ass unless they manage to nail openAI for it and shut it down, and good luck with that.
    gorgeous!
    - Re: (Score:2)
      
      by m00sh ( 2538182 ) writes:
      
      Better register pirate gpt domain then.
- - Re: I think they excluded /. (Score:4, Funny)
    
    by FudRucker ( 866063 ) writes: on Monday July 10, 2023 @11:55AM (#63674015)
    
    Sarah Silverman needs to pose for a photoshoot covered in hot grits and greased up yoda dolls
    
    Parent Share
    twitter facebook
  - Re: (Score:2)
    
    by godrik ( 1287354 ) writes:
    
    Do you have direct links to that? For AI training purposes, of course!
  - Re: (Score:2)
    
    by Samantha Wright ( 1324923 ) writes:
    
    Are you sure? Slashdot itself isn't even 'full' of that. Now, apk's HOSTS file crusade, on the other hand...
    - Re: (Score:2)
      
      by rsilvergun ( 571051 ) writes:
      
      I've done an exhaustive peer-reviewed study commissioned by the head of the mit's top science department. Also it was just a dumb joke referencing a stupid meme from a long time ago. Anyway you can find the study in last month's journal of science. It's why the thing was the size of a phone book.
      
      And always remember the Hot grits and greased up Yoda dolls are in all our hearts. Or elsewhere in the case of the yoda dolls
If the copyrighted work is actually in the LLM (Score:2, Insightful)

by rsilvergun ( 571051 ) writes:

Without being changed and they're going to have a problem. But if it's encoded and some fashion and that's the definition of the derivative work. I don't know enough about the actual technology behind LLMs to say one way or another.

That said there's so much money at stake with this technology and given a judge's tendency to side with the bigger property owner I think the owners of the LLMs are going to win.
- Re:If the copyrighted work is actually in the LLM (Score:5, Insightful)
  
  by godrik ( 1287354 ) writes: on Monday July 10, 2023 @11:41AM (#63673979)
  
  Well, if you look at the size of the model compared to the size of the input data, then you realize that ChatGPT is much more a fuzzy compressor than anything else. It's about 40TB of raw data to build a model of 1TB. That's essentially the ratio of zip compression.
  Playing with it, I was able to generate easily as-is copy of codes that are available online. So yeah, these lawsuits don't seem frivolous. Whether they'll win or not is a different question, but the suit is reasonnable.
  
  Parent Share
  twitter facebook
Better not read her book then. (Score:2)

by Lendrick ( 314723 ) writes:

I don't want to be sued for copyright infringement because I could summarize it.
The AI companies were lazy and greedy... (Score:3)

by williamyf ( 227051 ) writes: on Monday July 10, 2023 @11:02AM (#63673837)

... and did not want to collate the trainig material according to licenses.
Instead of using material in the public domain, suitable creative commons licenses, or under licenses (in the case of SW) like BSD, MIT, MPL, DWTFYWT (libcaca) that are more conductive, they got greedy, and used all material available, regardless of copyright...
Nor did they want to pay license holders (say, NYT, WaPo, etc.) to get access to their collection of material.
Well, you harvest what you sow. And you have very deep pockets to pay the army of lawyers that will defend you against the lawsuits.
Enjoy!
PS: Of course, they can (and will) destroy all instances of the current AI crops, and from chatGPT 5 (and all the others like Llama) onwards, they can do right by the licenses... On vera.

Share
twitter facebook
- Re:The AI companies were lazy and greedy... (Score:5, Insightful)
  
  by hjf ( 703092 ) writes: on Monday July 10, 2023 @11:17AM (#63673903) Homepage
  
  I, too, cannot wait for the future where I'll be sued by a book publisher for using the knowledge learned from a book they own copyright to.
  
  Parent Share
  twitter facebook
  - Re:The AI companies were lazy and greedy... (Score:4, Insightful)
    
    by brunes69 ( 86786 ) writes: <slashdot@@@keirstead...org> on Monday July 10, 2023 @12:00PM (#63674033)
    
    That is exactly what the OpenAI lawyers will argue
    Meanwhile the lawyers on the other side will argue that the model constitutes a "derivative work"
    And it will probably end up at the supreme court because this is all new territory with no clear answer.
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by StormReaver ( 59959 ) writes:
      
      Meanwhile the lawyers on the other side will argue that the model constitutes a "derivative work"[.]
      I suspect that Sarah's lawyers will argue that the OpenAI model could not have been trained on her works without it copying her works, and that is the crux of the copyright violation.
      And it will probably end up at the supreme court because this is all new territory with no clear answer.
      I disagree. This has been covered by copyright law time and time again, and I think the results have been pretty consistent across jurisdictions. There is nothing new or novel in this case. OpenAI copied and used copyrighted works to make and improve its commercial product for commercial gain, which is a prima-facia copyright v
      - Re: (Score:2)
        
        by null etc. ( 524767 ) writes:
        
        OpenAI copied and used copyrighted works to make and improve its commercial product for commercial gain, which is a prima-facia copyright violation.
        Google also copies and uses copyrighted works when its web browser shows copyrighted material to users. Google copies and uses copyrighted works when its search engine makes local copies of copyrighted works to inform the datasets that power Google's revenue-generating search engine. Google copied and used copyrighted books when it made book excerpts searchabl
  - Re:The AI companies were lazy and greedy... (Score:5, Informative)
    
    by ArchieBunker ( 132337 ) writes: on Monday July 10, 2023 @12:16PM (#63674115)
    
    You could try reading the points of the lawsuit. https://storage.courtlistener.... [courtlistener.com]
    Indeed, when ChatGPT is prompted, ChatGPT generates summaries of Plaintiffs’
    copyrighted works—something only possible if ChatGPT was trained on Plaintiffs’ copyrighted works.
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by chmod a+x mojo ( 965286 ) writes:
      
      What is your point?
      It's actually vastly far more likely, just from how these models are trained and how they operate, that the books in question were never even used, and random reviews from the internet were scraped. Then the model learns the general content of the reviews and spits out something similar to all of the reviews, but not the same as any of them. Just as if you read 5-10 reviews of the book and were asked to generalize what it was about...
    - Re: (Score:2)
      
      by sabt-pestnu ( 967671 ) writes:
      
      Artists are trained on prior copyrighted works. Musicians are trained on prior copyrighted works. Authors are trained on prior copyrighted works.
      ChatGPT is trained on prior copyrighted works. And yet ChatGPT (and not artists nor musicians) is infringing?
      ChatGPT generates summaries of Plaintiffs' copyrighted works -- something only possible if ChatGPT was trained on Plaintiffs' copyrighted works.
      A reviewer can generate a summary of Plaintiff's copyrighted works, only if the reviewer was trained on Plaintiff's copyrighted works.
      From the suit:
      57. Because the OpenAI Language Models cannot function without the expressive information extracted from Pl
  - Re: (Score:2)
    
    by Xylantiel ( 177496 ) writes:
    
    But that happens all the time already for derivative works. A numerically optimized (aka "trained") semi-arbitrary algorithm (aka "AI") is a derivative work of its training data. You are not. You may or may not produce a derivative work when using information out of a book, it depends on the output just like it always has. The difference here is that an optimized algorithm is a work unto itself because it is a representation of its training data. (You see how most of the problem exists because machine
    - Re: (Score:2)
      
      by hjf ( 703092 ) writes:
      
      What's clear to me is that you use the word "training" and "parameters" without knowing what they mean.
- Re:The AI companies were lazy and greedy... (Score:4, Insightful)
  
  by Holi ( 250190 ) writes: on Monday July 10, 2023 @11:18AM (#63673907)
  
  Considering how copyright has been so thoroughly abused by congress and corporations to the point it makes a mockery of its stated goal "To promote the progress of science and useful arts, by securing for limited times to authors and inventors the exclusive right to their respective writings and discoveries", I have little concern about these millionaires and their tears.
  I don't see how you can read that clause and consider the authors life plus 70 as securing a "limited time" to authors. It seems to fly in the face of the wording of the constitution. Let's remember when the founders wrote it copyright had a maximum length of 28 years.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by quantaman ( 517394 ) writes:
    
    Considering how copyright has been so thoroughly abused by congress and corporations to the point it makes a mockery of its stated goal "To promote the progress of science and useful arts, by securing for limited times to authors and inventors the exclusive right to their respective writings and discoveries", I have little concern about these millionaires and their tears.
    I don't see how you can read that clause and consider the authors life plus 70 as securing a "limited time" to authors. It seems to fly in the face of the wording of the constitution. Let's remember when the founders wrote it copyright had a maximum length of 28 years.
    More to the point.
    To promote the progress of science and useful arts kinda suggests that the US constitution kinda demands that it be possible to train LLMs with relatively few copyright restrictions.
    Of course, what the US constitution says and what US courts say can be very different things.
- Re: (Score:2)
  
  by thegarbz ( 1787294 ) writes:
  
  ... and did not want to collate the trainig material according to licenses.
  And why would they? What concept in copyright law restricts your ability to learn from what you see? To be clear the case here isn't about how they acquired the material, or replication of it, the case here is based on the idea of training the algorithm and it producing something that resembles the style of another artist. That is not a copyrightable concept.
  - Re: (Score:2)
    
    by StormReaver ( 59959 ) writes:
    
    What concept in copyright law restricts your ability to learn from what you see?
    Nothing in copyright law restricts your ability to learn from what you see. It does, however, restrict your ability to copy copyrighted works without permission. LLM's do not see and learn. They copy, analyze, collate, aggregate, statistically predict, and reproduce. They are, at their core, copying engines. It's their primary function.
Copying vs Learning (Score:2)

by ThosLives ( 686517 ) writes:

If I go to a website and read it, which does involve a "copy" to put it on my screen, but I learn it - is that copyright infringement, or just learning?
I'm pretty sure that reading something and learning it isn't copyright infringement. I'm fairly certain LLMs aren't "copying" the raw material any more than a person memorizing a favorite quote is.
Interesting times, to see how this gets worked out in our legislative and social systems.
- Re:Copying vs Learning (Score:5, Insightful)
  
  by hjf ( 703092 ) writes: on Monday July 10, 2023 @11:20AM (#63673915) Homepage
  
  This is exactly the point. And slashdot's bipolarity about this issue is amazing.
  You have an overlap in "AI haters" and "Copyright haters". They see AI as the greater threat, so they (think) they will ally with copyright holders to destroy AI, and then go back to hating copyright holders.
  The reality is that copyright holders will only get stronger if they win, AND, they will try to use this same concept to reserve their rights to demand compensation from people using their books to do their jobs.
  Saying that OpenAI shouldn't use publicly available (but copyrighted) information without compensating the authors, is like saying an engineer can't build a bridge without paying royalties to the author of the books they learned from.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by mobby_6kl ( 668092 ) writes:
    
    Yeah it's pretty wild to see a big chunk of the tech worlds like here, on arstechnica etc. suddenly flip to be pro-copyright. Ah yes now copyright holder gets to control all possible future uses of their work, forever. Very cool.
    IMO as long as the models don't reproduce the original work more or less completely and highly accurately (let's say the few overtrained examples in Stable Diffusion, like the Mona Lisa) then there's no real case here. Of course I'm not the one deciding it so who knows what the cour
- Copyright says I have the right to restrict access (Score:2)
  
  by Bruce66423 ( 1678196 ) writes:
  
  At its core, it's an attempt to ensure authors get paid for the books and articles that they publish. If it's on your screen either it's there legal or it's there illegally. If it's legally, then permission will have been given in some way. If illegally - because the content has been uploaded to a server somewhere without the permission of the author, then you're in violation of copyright
  The argument of the authors here is, presumably, that they got access to material that was not intended for general consu
- - Re: (Score:3)
    
    by Travelsonic ( 870859 ) writes:
    
    Saying a neural net is replicating how humans, and groups of biological neurons, learn - irrespective of if it is accurate or not - isn't "making it human," it's literally comparing functionality.
    Saying an emulator runs a SEGA Genesis game in a cycle perfect manner isn't saying it's literally a real SEGA Genesis, it's comparing functionality.
    That is, comparing functionality isn't making a greater statement about the thing whose functionality is being compared, it's literally looking at just the functional
Let me guess (Score:3, Funny)

by PPH ( 736903 ) writes: on Monday July 10, 2023 @11:12AM (#63673879)

In the training data set, Silverman's material was tagged as "not funny".

Share
twitter facebook
- Re: (Score:2)
  
  by Petersko ( 564140 ) writes:
  
  Women are fighting an uphill battle in comedy, because of the widespread belief that women just aren't funny. Unfortunately, Netflix has released such a torrent of unfunny women specials that the belief is really reinforced. Seriously, Netflix... good god. The litany of awful female standups you have presented to the world is doing real damage. Stop it. Exercise a little restraint.
  I think Sarah Silverman is funny. I've seen "A Speck of Dust" twice. It's at least as good as Patton Oswalt's most recent offeri
  - Ali Wong (Score:2)
    
    by pr0t0 ( 216378 ) writes:
    
    I don't know, Ali Wong: Baby Cobra was pretty damned funny, and I rarely find stand-up all that funny regardless of the gender of the person with a mic in their hand.
  - - Re: (Score:2)
      
      by Petersko ( 564140 ) writes:
      
      What about my post triggered you? I'm curious. Was it that I dared to suggest there are some good female comedians? Because there are.
      - Re: (Score:3, Informative)
        
        by Beyond_GoodandEvil ( 769135 ) writes:
        
        ... right-wing talking points, lame jokes, misogyny .... this must be an incel convetion.
        Remember kids, slut shaming is bad, as is using sexual orientation as an insult, but incel is still socially acceptable because there are no bad words, just bad targets.
        
        Re: (Score:2)
        
        by Petersko ( 564140 ) writes:
        
        I see. My question has been answered. Thank you.
    - Re: (Score:2)
      
      by PPH ( 736903 ) writes:
      
      ... right-wing talking points, lame jokes, misogyny .... this must be an incel convetion.
      It's the root of "the left can't meme" ideology. When the basis of one's political ideology is to be offended by practically everything, humor goes right out the window.
      I actually think Silvernan is pretty funny. Edgy too. But she pushes against some of the left wings pet dogmas for a laugh. And so she gets labeled as "no funny" by them. Which is even funnier when a woman is doing it. The resulting exploding heads are absolutely hilarious.
  - - Re: (Score:2)
      
      by Petersko ( 564140 ) writes:
      
      I don't mind Patton Oswalt. He's a decent straight up the middle comedian. Not in my top twenty, though. But he's a working professional who puts out a product with decent care about it's quality.
      I love standup comedy. I consider it the 20th centuries great contribution to art. My tastes are not particularly confined. I love humour that rides the grotesque, but I also admire those who can find a way to be humourous without being edgy.
      If you want to know where my tastes don't lie, I don't think Seinfeld is f
      - Re: (Score:2)
        
        by laxguy ( 1179231 ) writes:
        
        we could be friends
- Re: (Score:2)
  
  by dohzer ( 867770 ) writes:
  
  I don't know... that Paris Hilton roast was funny as fuck.
She's right but... (Score:2)

by rwrife ( 712064 ) writes:

...this is basically biting the hand that feeds you and is a good way to get yourself removed from key services that are used to promote your identity and business.
- Re: (Score:2, Insightful)
  
  by HBI ( 10338492 ) writes:
  
  I don't necessarily agree about infringement but you do have a good point. This lawsuit is a tacit acknowledgement that their careers are over.
- - Re: (Score:2)
    
    by Travelsonic ( 870859 ) writes:
    
    and its derivative 'me too' copycats will have no real power and will have to pay content creators fees and percentages to use their material.
    Isn't that assuming they are successful (when we don't know, and it can go either way at least currently), and also miss that if any model ends up open source, it becomes much closer to impossible to actually eradicate (as opposed to maybe slow down)?
    
    Seems like some big question makes that make certainty about a particular outcome a bit misguided.
Useright (Score:2)

by Sloppy ( 14984 ) writes:

The article's headline says they're suing over copyright infringement. But then by the middle of the article..
In their lawsuit against Meta, the plaintiffs allege that leaked information about the company’s artificial intelligence business shows their work was used without permission.
.. the question of whether or not it was copied seems to have been abandoned, and they're actually suing over how the data was used.
Anyone know for how many years an artist of a creative work, is the sole person who use a
- Re: (Score:2)
  
  by fluffernutter ( 1411889 ) writes:
  
  You can learn from it, you just can't make a digital representation of it for yourself unless you pay for it. The AI learning is still a digital representation.
Did they buy the book? (Score:2)

by superdave80 ( 1226592 ) writes:

Silverman, Kadrey and Golden allege Meta and OpenAI used their books without authorization to develop their so-called large language models
Well, if they purchased a copy of the book, then how did they infringe on the copyright? If I buy a book, I can 'use' the book as much as I want without further 'authorization'. Even funnier in TFA:
“retains knowledge of particular works in the training dataset," the lawsuit says.
Um, am I not allowed to 'retain knowledge' of the shit I bought and paid for? I don't even understand where the infringement is...
Sad timess (Score:2)

by sizzlinkitty ( 1199479 ) writes:

These people should be honored that their content was used to train advance intelligence systems.
- Re: (Score:2)
  
  by Narcocide ( 102829 ) writes:
  
  No, the rights of citizens do not automatically extend to arbitrary software constructs just because your business plan relies on them doing so.
  - Re: (Score:2)
    
    by thegarbz ( 1787294 ) writes:
    
    No, the rights of citizens do not automatically extend to arbitrary software constructs just because your business plan relies on them doing so.
    Conversely the rights do not magically cease applying simply because a software construct used. Unless the law specifically allows or disallows the people or the software the concept in it applies equally to both.
    You're reading this, and may learn something as a result. I do not have the ability to sue you for copyright. Likewise if you were a computer the same concept applies. If you regurgitate it verbatim it falls under copyright. If you quote a section of it for the purpose of discussion if falls under
  - - Re: (Score:3)
      
      by jonsmirl ( 114798 ) writes:
      
      Under copyright law transformative works are allowed as fair use. Running something through a LLM sure seems transformative to me. However, I do see a need to ensure that the LLM can't be convinced to serve up an unaltered copy of the work without proper attribution and permission.
      - Re: (Score:3)
        
        by DarkRookie2 ( 5551422 ) writes:
        
        The output maybe, but this is about the training data they hold on to.
        If they didn't pay for a license for a work, they should not be using it to generate output.
        
        Re: (Score:2)
        
        by jonsmirl ( 114798 ) writes:
        
        In that case they'd just need to own a copy of the book. If this was scraped off the Internet from a site offering illegal copies, then that site should be shut down. I am not a fan of the philosophy that says if you own a physical book you can't make a PDF of it for your own convenience as long as both copies are in your possession. My position is that you bought a copy of the story, not a physical object. As long as only one of the forms is in use at a time, then it is ok.
        Alternatively, they could just
        
        Re: (Score:2)
        
        by DarkRookie2 ( 5551422 ) writes:
        
        I do PDF suck
        
        Doesn't work like that. (Score:2)
        
        by denzacar ( 181829 ) writes:
        
        Neither owning nor renting/borrowing a copy of... say... a Taylor Swift CD, makes it legal for you to remix a version of your own for commercial purposes.
        Which is what any "AI" machine basically does - it produces remixes.
        And all those "AI" companies are running their remix-makers for commercial purposes.
        I can't wait until it dawns on IP companies that AI companies owe them millions.
        
        Re:Doesn't work like that. (Score:4, Insightful)
        
        by Comboman ( 895500 ) writes: on Monday July 10, 2023 @12:13PM (#63674095)
        
        >>Neither owning nor renting/borrowing a copy of... say... a Taylor Swift CD, makes it legal for you to remix a version of your own for commercial purposes.
        Which is what any "AI" machine basically does - it produces remixes.
        If a budding musician listens to Taylor Swift CDs and is inspired to create their own music in that style, that is perfectly legal (and not a remix). That is what AI is doing. All human creators have petabytes of "training data" in their brains from everything they've ever seen and heard, and use that to create new output. AI is no different.
        
        Parent Share
        twitter facebook
        
        Re: (Score:2)
        
        by denzacar ( 181829 ) writes:
        
        A software construct is neither a musician nor even a person.
        Nor is it capable of creativity as it lacks even bare minimum of consciousness, let alone a human level of self-awareness or awareness at all, beyond that which is plugged into it.
        By humans, mind you. For human purposes. Mainly monetary ones.
        None of which the software construct understands, comprehends or values. It is a copy-machine with predictive auto-correct spliced in.
        It is a possession. It has a copyright or a trademark sign next to its name
        
        Re: (Score:2)
        
        by Travelsonic ( 870859 ) writes:
        
        Who is arguing that it is - versus that people are trying to make it operate like one? I mean, let's say we have a hypothetical perfect prosthetic leg that works like a real leg in every way - but was mechanical. Saying it operates like a real leg isn't saying it is a real leg, it's just literally comparing the functionality.
        My point being, I think there is a miscommunication or confusion regarding the difference between comparing something, and part of something.
        Also, a human arguing that human consciousness is nothing but training data, in essence arguing for lack of personal agency
        How so?
        
        Re: (Score:3)
        
        by Comboman ( 895500 ) writes:
        
        >>A software construct is neither a musician nor even a person.
        Why should that matter?
        >>Nor is it capable of creativity as it lacks even bare minimum of consciousness, let alone a human level of self-awareness or awareness at all, beyond that which is plugged into it. By humans, mind you. For human purposes. Mainly monetary ones.
        Again, why should that matter? Creativity (i.e. the ability to create something) is not consciousness. You seem to be arguing that consciousness is a prerequisite for
        
        Re: (Score:2)
        
        by quantaman ( 517394 ) writes:
        
        >>Neither owning nor renting/borrowing a copy of... say... a Taylor Swift CD, makes it legal for you to remix a version of your own for commercial purposes.
        Which is what any "AI" machine basically does - it produces remixes.
        If a budding musician listens to Taylor Swift CDs and is inspired to create their own music in that style, that is perfectly legal (and not a remix). That is what AI is doing. All human creators have petabytes of "training data" in their brains from everything they've ever seen and heard, and use that to create new output. AI is no different.
        It's part of what humans do, a lot of what we do when communicating or being creative is our of auto-complete. But there's another level we have (editorial?) that isn't really in AIs.
        But back to the example, if that budding musician then creates what they think is an original tune, but is really a ripoff of a Taylor Swift song they once heard and forgot about then they get sued.
        But LLMs will happily readily do the same on a scale where it's pretty much impossible to enforce those copyrights.
        There's also the
      - Re: (Score:2)
        
        by pavon ( 30274 ) writes:
        
        That is one of four factors *considered* in determining fair use. Something can be radically transformative and still be considered copyright infringement if it fairs poorly on the other factors. The effect upon the work's value is a big one when talking about ML, particularly where the value the model provides is competing directly with the works used to train it (for example, stock art).
- Re:without merit (Score:4, Insightful)
  
  by Joce640k ( 829181 ) writes: on Monday July 10, 2023 @12:05PM (#63674063) Homepage
  
  When you publish a book, you are granting people the right to read it. I would assume this right extends to software as well.
  I wonder what books and TV shows Sarah Silverman has read/watched before becoming famous and writing her own.
  I'm sure her talent didn't develop in a vacuum.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by trawg ( 308495 ) writes:
    
    I wonder what books and TV shows Sarah Silverman has read/watched before becoming famous and writing her own.
    I'm sure her talent didn't develop in a vacuum.
    Did she buy the books, or borrow them from a library? Did she watch the TV shows on Netflix, or buy them on DVD, or rent them from Blockbuster? Did she see the movies in a theatre?
    If so, she consumed all the content in accordance with copyright law and in a way that ensures rights holders are paid for their work.
    A better analogy would be if Sarah Silverman downloaded everything off Pirate Bay and then went on to build a career from pirated works, where she contributed nothing to the rights holders and cont
    - Re: (Score:2)
      
      by null etc. ( 524767 ) writes:
      
      Being licensed to consume materials is not the same as being licensed to create derivative works.
- Re: (Score:2)
  
  by whitroth ( 9367 ) writes:
  
  No. It's illegal copying. Backing up is one thing... after you pay for it. Scraping it is reproducing FOR PROFIT, since the chatbots are making money.
  - Re: (Score:2)
    
    by Travelsonic ( 870859 ) writes:
    
    Scraping it is reproducing FOR PROFIT,
    IDK, it sounds like a potentially tenuous link - since the scraping is done for the training - the resulting model may be for profit, but even so, is that alone gonna be significant / can it be ruled as significant without effing up things already done for profit that fall under fair use?
- Re: (Score:2)
  
  by Entrope ( 68843 ) writes:
  
  Reading a book does not involve making a copy [cornell.edu], which under the Copyright Act is an object containing a work that has been "fixed by any method now known or later developed, and from which the work can be perceived, reproduced, or otherwise communicated, either directly or with the aid of a machine or device".
  Legally, loading a book or program into a computer's RAM is making a copy of the work. To riff on the example up-thread, one might have granted Slashdot and its readers licenses to copy one's comments
- Re: (Score:3)
  
  by Bahbus ( 1180627 ) writes:
  
  Defending what reason? And how? Sarah Silverman clearly doesn't understand basic copyright law, since no infringement happened.
  - Re: (Score:2)
    
    by Petersko ( 564140 ) writes:
    
    Sarah Silverman doesn't need to understand copyright law. Her lawyer does.
    - Re: (Score:3)
      
      by Bahbus ( 1180627 ) writes:
      
      Clearly, her lawyer does not.
- Re: (Score:2)
  
  by godrik ( 1287354 ) writes:
  
  My position is that we should blow copyright law entirely. But as long as we are going to have it, we shouldn't make an exception "because AI".
- Re: (Score:2)
  
  by Registered Coward v2 ( 447531 ) writes:
  
  If it's public, it's fair use. I understand the concept of licences, but I also know that they're not enforceable when the text is public.
  Except it is not public, in the sense public means not copyrighted. If the act of making something available for view meant it was now "public" and those any license is no longer enforceable the whole concept of copyright and licenses would no longer exist. While some may argue that is great, it has a lot of consequences. The GPL could no longer be enforceable, since it makes the code text
  The AI companies behind chat bot make the same argument you do. I suspect, however, if someone finds a way to use say, M
  - Re: (Score:2)
    
    by Freischutz ( 4776131 ) writes:
    
    If it's public, it's fair use. I understand the concept of licences, but I also know that they're not enforceable when the text is public.
    Except it is not public, in the sense public means not copyrighted. If the act of making something available for view meant it was now "public" and those any license is no longer enforceable the whole concept of copyright and licenses would no longer exist. While some may argue that is great, it has a lot of consequences. The GPL could no longer be enforceable, since it makes the code text
    The AI companies behind chat bot make the same argument you do. I suspect, however, if someone finds a way to use say, Meta's, chatbot to recreate the data used to train it to train their ChatBot Meta would cry foul and unleash the lawyers. I doubt they'd buy the argument taht once the ChatBot answered the answer was "public."
    True, they would cry havoc and let loose the dogs of law. Furthermore, with everybody from Tucker Carlson through Fox News, OANN, Breitbart, the Daily Wire to Elon Musk whining about ChatGPT being 'preachy', 'ideological' and 'woke', you'd think that the right-wingnuts would be happy about ChatGPT not being fed any more material from scathing left-wing satirists who've made a successful career of savagely mocking and lampooning the political right with a particular focus on its lunatic fringe. The only thi
- Re: JustStopOligarchy (Score:2)
  
  by blue trane ( 110704 ) writes:
  
  Where is the Tim Berners-Lee of AI?
- Re: (Score:2)
  
  by NoMoreDupes ( 8410441 ) writes:
  
  by suing the hot companies du jour, is what I see.
  And she'll continue to be far more popular than you can ever hope to be.
  - Re: (Score:2)
    
    by _merlin ( 160982 ) writes:
    
    If popularity is correlated with quality, McDonalds serves the best food in America.
    - Re: (Score:3)
      
      by NoMoreDupes ( 8410441 ) writes:
      
      If popularity is correlated with quality, McDonalds serves the best food in America.
      That comment conclusively explains Trumpism.
      - Re: (Score:2)
        
        by NoMoreDupes ( 8410441 ) writes:
        
        Still far preferable to being a Trumper.
- Re: (Score:2)
  
  by ichthus ( 72442 ) writes:
  
  Totally off topic (so, down-mod me if you need to), but...
  
  Has anyone ever seen Sarah Silverman and Eli Roth in the same place at the same time?
- Re: (Score:2)
  
  by fluffernutter ( 1411889 ) writes:
  
  Is AI more like a person reading a passage and memorizing it or a computer memorizing the passage by scanning it and saving it to the drive? The former is not illegal the latter is. I would say AI is more like the latter because it's simply not biological. It is a digital representation, just not in a digital file format.
- Re: (Score:2)
  
  by fluffernutter ( 1411889 ) writes:
  
  If you took the book, memorized it, then typed it out like AI does then it would still be a copyright violation.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Copyright Infringement. Can I also sue? (Score:5, Insightful)

Re:Copyright Infringement. Can I also sue? (Score:5, Funny)

Re: (Score:3)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:3)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

ppl want to get payed (Score:2)

Re: (Score:2)

Re: (Score:2)

Especially since ... (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re:Copyright Infringement. Can I also sue? (Score:4, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: I think they excluded /. (Score:4, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

If the copyrighted work is actually in the LLM (Score:2, Insightful)

Re:If the copyrighted work is actually in the LLM (Score:5, Insightful)

Better not read her book then. (Score:2)

The AI companies were lazy and greedy... (Score:3)

Re:The AI companies were lazy and greedy... (Score:5, Insightful)

Re:The AI companies were lazy and greedy... (Score:4, Insightful)

Re: (Score:2)

Re: (Score:2)

Re:The AI companies were lazy and greedy... (Score:5, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re:The AI companies were lazy and greedy... (Score:4, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Copying vs Learning (Score:2)

Re:Copying vs Learning (Score:5, Insightful)

Re: (Score:2)

Copyright says I have the right to restrict access (Score:2)

Re: (Score:3)

Let me guess (Score:3, Funny)

Re: (Score:2)

Ali Wong (Score:2)

Re: (Score:2)

Re: (Score:3, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

She's right but... (Score:2)

Re: (Score:2, Insightful)

Re: (Score:2)

Useright (Score:2)

Re: (Score:2)

Did they buy the book? (Score:2)

Sad timess (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Doesn't work like that. (Score:2)

Re:Doesn't work like that. (Score:4, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)