MIT Apologizes, Permanently Pulls Offline Huge Dataset That Taught AI Systems To Use Racist, Misogynistic Slurs (theregister.com) 128

Posted by msmash on Wednesday July 01, 2020 @01:04PM from the how-about-that dept.

MIT has taken offline its highly cited dataset that trained AI systems to potentially describe people using racist, misogynistic, and other problematic terms. From a report: The database was removed this week after The Register alerted the American super-college. And MIT urged researchers and developers to stop using the training library, and to delete any copies. "We sincerely apologize," a professor told us. The training set, built by the university, has been used to teach machine-learning models to automatically identify and list the people and objects depicted in still images. For example, if you show one of these systems a photo of a park, it might tell you about the children, adults, pets, picnic spreads, grass, and trees present in the snap. Thanks to MIT's cavalier approach when assembling its training set, though, these systems may also label women as whores or bitches, and Black and Asian people with derogatory language. The database also contained close-up pictures of female genitalia labeled with the C-word.

Applications, websites, and other products relying on neural networks trained using MIT's dataset may therefore end up using these terms when analyzing photographs and camera footage. The problematic training library in question is 80 Million Tiny Images, which was created in 2008 to help produce advanced object detection techniques. It is, essentially, a huge collection of photos with labels describing what's in the pics, all of which can be fed into neural networks to teach them to associate patterns in photos with the descriptive labels. So when a trained neural network is shown a bike, it can accurately predict a bike is present in the snap. It's called Tiny Images because the pictures in library are small enough for computer-vision algorithms in the late-2000s and early-2010s to digest.

MIT Apologizes, Permanently Pulls Offline Huge Dataset That Taught AI Systems To Use Racist, Misogynistic Slurs

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 128 Comments Log In/Create an Account

Comments Filter:

Did somebody back this up? (Score:2, Insightful)

by Anonymous Coward writes:

Hope so. Let's keep the internet indelible, if for no other reason than to piss off the easily offended. We can't let these meddlesome fools take over the world
- Doomed to fail turing test (Score:4, Insightful)
  
  by goombah99 ( 560566 ) writes: on Wednesday July 01, 2020 @01:41PM (#60251240)
  
  Any AI trained on a sanitized database will never be able to pass the Turing test. Nor will it be very good for processing a lot of corpuses.
  
  - Re: (Score:2)
    
    by fibonacci8 ( 260615 ) writes:
    
    I like this variation of "you can't tell jokes any more", despite scores of successful comedians demonstrating otherwise.
  - Re: (Score:3)
    
    by vlad30 ( 44644 ) writes:
    
    Sarcasm
    Movie people are liberal minded and know what to think. Just use Movies as a training set just as many humans do
    Lets see Historically accurate movies are now defined as Racist as are movies made even a few years back
    That leave you with the current crop of SJW inspired versions
    The AI now believes the world is full of LGBT and all the other letters. For couples everyone is married interracial. but we are all Jewish Italians are funny and have large families and the Chinese are all good guys and wh
- - Re: (Score:1)
    
    by Presence Eternal ( 56763 ) writes:
    
    "Pudenda" is vastly more disparaging than "cunt", etymologically speaking.
  - Re: (Score:3, Informative)
    
    by ShanghaiBill ( 739463 ) writes:
    
    The problems with this database can be fixed with a small shell script.
    sed s/cunt/vagina/gi
    sed s/shit/feces/gi
    sed s/bitch/assertive woman/gi ...
    Deleting the DB is a silly overreaction.
    - Re: (Score:2)
      
      by Killall -9 Bash ( 622952 ) writes:
      
      Delete culture == cancel culture.
  - Re: (Score:1)
    
    by ghoul ( 157158 ) writes:
    
    This is more like the kid who has the soccer ball going home because the other kids wont play by his rules. MIT is saying , if you criticize us we will stop letting you use our dataset so zip it.
    - Re: (Score:1)
      
      by Shotgun ( 30919 ) writes:
      
      Are you kidding? If they keep it up, people might say bad things about them on Twitter. THEY. HAVE. NO. CHOICE>
      - Re: (Score:2)
        
        by ghoul ( 157158 ) writes:
        
        So the kid is going home because the other kids said mean things to him?
        
        Re: (Score:2)
        
        by SharpFang ( 651121 ) writes:
        
        The kid is not going home, just back to the snacks table, to eat what parents of other kids brought to the party.
- - Re: (Score:2)
    
    by K. S. Kyosuke ( 729550 ) writes:
    
    Of course not; you were just trying to shine your shoes using modern technology.
Great. (Score:2)

by WindBourne ( 631190 ) writes:

Now, we are teaching AIs how to be racist/misogynist.

This was one that MIT really fucked up.
- Re:Great. (Score:4, Funny)
  
  by Brain-Fu ( 1274756 ) writes: on Wednesday July 01, 2020 @01:20PM (#60251144) Homepage Journal
  
  This was clearly a problem of quality control. Some pranksters injected these images into the database, against guidelines. It is hard to filter for this sort of thing, since every image must be examined by a human.
  Now, if only they had an AI system that was trained to detect and filter out submissions like that.....
  
  - Re: (Score:2)
    
    by jgtg32a ( 1173373 ) writes:
    
    You really don't need an AI to do basic text searching to filter out words you don't like.
    - Re: (Score:3)
      
      by Known Nutter ( 988758 ) writes:
      
      You really don't need an AI to do basic text searching to filter out words you don't like.
      Yo dawg, I heard you like AI, so I put some AI in your AI to manage your AI.
  - Re: (Score:2)
    
    by Shaitan ( 22585 ) writes:
    
    Nope, you need to connect the dots for the uninitiated. .....which would require a database like they just destroyed to train.
  - Another good guess by Robert Heinlein (Score:5, Insightful)
    
    by fibonacci8 ( 260615 ) writes: on Wednesday July 01, 2020 @02:26PM (#60251408)
    
    In the book "The Moon is a Harsh Mistress", Manny and Wyoming both help categorize the jokes being taught to Mike. It's valuable to get input from different perspectives when teaching an AI, fictional or real.
    
  - Re: (Score:2)
    
    by Aighearach ( 97333 ) writes:
    
    There is absolutely no reason to need a dataset larger than can be reviewed by n humans.
- Re: (Score:3, Insightful)
  
  by Archangel Michael ( 180766 ) writes:
  
  We're not teaching AI to be anything.
  The problem with AI, is that nobody is teaching it anything. Meaning there is no parent to shape the growth of learning. And unfiltered learning is exactly what you see here, where the connotations are simply excluded from the data set.
  Female Genitalia labeled with "C" word is technically correct but misses the connotation that is learned socially. Without "meaning" it would be easy to see how AI would refer to women with that word. Technically correct, but completely vo
  - Re: (Score:2)
    
    by SirAstral ( 1349985 ) writes:
    
    Then it's not AI.
    O right... I keep forgetting I am on slashdot where
    if( a == b) { "We Just Invented AI!" }
    else (a != b) { "We Just Invented AI!" }
    The things we call AI is a joke!
    - Re: (Score:2)
      
      by technothrasher ( 689062 ) writes:
      
      https://en.wikipedia.org/wiki/... [wikipedia.org]
      - Re: (Score:2)
        
        by SirAstral ( 1349985 ) writes:
        
        Yes, I am more and well aware of all the efforts of the world to keep calling things they are not... even if you have to go down the route of altering the definitions of things to fit your narrative.
        If you have to remove a "data set" to resolve a problem of bias then you just do not have any intelligence in the program. Just like with humans... you don't remove data to remove bias... you ADD fucking data!
        Are they adding data here? No... Why? Because this is not fucking AI to any degree that ADDing data r
        
        Re: (Score:2)
        
        by StormReaver ( 59959 ) writes:
        
        ...you ADD fucking data!
        So to fix a problem with bias, you point the program to pornhub? Not that I'm objecting.
        
        Re: (Score:3)
        
        by SirAstral ( 1349985 ) writes:
        
        Lol, as crazy as it sounds, that is exactly what you do. If you actually have an AI you teach it... you don't just add/subtract data...
        You give a kid a massive multiplication table, but you still have to teach! Also, programs are always biased, the trick is making sure their bias works in our favor. For example... we would want a program to have multiple biases... just like humans. We just want those biases to all be what help make humanity better vs the biases that makes humanity worse. A good bias is
        
        Re: (Score:2)
        
        by Brain-Fu ( 1274756 ) writes:
        
        altering the definitions
        The English language does not have an official authority for definitions. Popular use, alone, alters definitions. This enrages people who prefer the old definitions, but their rage does nothing to prevent the phenomenon.
        The phrase "Artificial Intelligence" is a good example of this, because when John McCarthy originally coined the phrase, it had a really broad meaning that covered several classes of algorithm that existed at the time. The intent was along the lines of "a non-intel
        
        Re: (Score:2)
        
        by phantomfive ( 622387 ) writes:
        
        What you don't understand is that when done right, every concept that can ever be needed will be expressed by exactly one word, with its meaning rigidly defined and all its subsidiary meanings rubbed out and forgotten. It's all very logical and correct.
    - Re: (Score:2)
      
      by Aighearach ( 97333 ) writes:
      
      Then it's not AI.
      You've been told dozens of times that "AI" is an academic department at Universities that may or may not be part of the Computer Science program.
      It is not a descriptive claim about the intellectual capabilities of a machine.
      You can't comprehend that, because your own "intelligence" is merely an artificial category applied solely due to your taxonomic classification; very much like that machine in that way, in fact.
  - Re: (Score:2)
    
    by jythie ( 914043 ) writes:
    
    Well, that is the modern trend. The domains of AI that get all the funding (i.e. ones that are most applicable to shopping recommendations and other such search problems) really come down to combining lots of data with lots of processing power and tweaking things here and there till something passes. Understanding what is happening inside the model is out of fashion
- Re: (Score:2)
  
  by ewibble ( 1655195 ) writes:
  
  Did we really? Seriously a database of 80 Million images, out of that how many where offensive? Do you really expect a few MIT students to manually inspect all these images for offense, they probably just scraped the internet for them. Also from the article the database include words like pedophile, child molester, molester, and rape suspect, all of which I assume where associated with men. What exactly does a child molester look like?
  An article headline and summary made out to look someone was against the
- Re: (Score:2)
  
  by Shaitan ( 22585 ) writes:
  
  This IS one MIT really fucked up. A dataset like this doesn't teach AI to be racist, it provides the source material to allow an AI to understand what slang and racist terms identify. An example of a valid use case is differentiating between classic hip-hop music and Klan speech in a massive library of audio recordings.
  These monsters pushing erasure and the rewriting of history need to be stopped. It is a revolution that ends in a single party solution with a heavily walled and divided disarmed populace and
  - Re: (Score:1)
    
    by SirAstral ( 1349985 ) writes:
    
    Burn History, Burn everything we don't like, burn anything that offends me.
    Yea, that is pretty much what all people in the wrong like to do. Silencing people, cancel culture, and SJW's are all busy spinning their wheels because none of their arguments ever stand up to any meaningful scrutiny.
    The First Lie usually wins the argument so lie first and lie often!
    Make sure they are called a racist, misogynist, homophobe, xenophobe, bigoted prick, anything so long as you do it before they have a chance to prove y
    - Re: (Score:1)
      
      by Shaeun ( 1867894 ) writes:
      
      On the plus side, this is a privacy win!
    - Re: (Score:2)
      
      by jythie ( 914043 ) writes:
      
      Ahm.. so you prefer the old lies because they won, thus freezing history in place exactly where it makes you feel most comfortable? Yeah.. real rational there.
      - Re: (Score:2)
        
        by Shaitan ( 22585 ) writes:
        
        The crap being made up isn't the truth it is just fiction.
  - Re: (Score:2)
    
    by WindBourne ( 631190 ) writes:
    
    A dataset like this doesn't teach AI to be racist, it provides the source material to allow an AI to understand what slang and racist terms identify.
    
    Actually, good point. You have to have negatives as well as positive, esp. if you want an AI/person to discern the difference.
    - Re: (Score:2)
      
      by Shaitan ( 22585 ) writes:
      
      "Actually, good point. You have to have negatives as well as positive, esp. if you want an AI/person to discern the difference."
      Yes, they are all just labels and images and a linkage between the two. There is nothing innately evil or racist about a grouping of letters. That is just data. Racism is about intent not the labels which are used. Since "AI" as it exists today lacks the ability to have intent, it is the intent of the creator that matters. Then there is also prejudice/bias but no matter how bad the
- Re: (Score:2)
  
  by greenwow ( 3635575 ) writes:
  
  Except they weren't. This is more whiny xian "facts are racist" garbage. They can't deal with the real world so they have to make up lies about invisible men in the sky talking to them. They'd rather have no technology than have technology that tells the damn truth. Their entire belief system is based on lies.
Re: (Score:2)

by account_deleted ( 4530225 ) writes:

Comment removed based on user account deletion
- Re: (Score:2)
  
  by DarkOx ( 621550 ) writes:
  
  It all depends on where the data comes from. Did the data even all come from people who knew they were producing training data, or do think someone should be charged if the deliberately fill out a reCAPTCH wrong?
  - Re: (Score:3)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
    - Re: (Score:2)
      
      by NicknameUnavailable ( 4134147 ) writes:
      
      With 80 million images it was almost certainly just skimmed from the internet.
      - Re: (Score:2)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
  - Re: (Score:2)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
    - Re: (Score:2)
      
      by Headw1nd ( 829599 ) writes:
      
      Please select all images with the town bicycle
- Re: (Score:2)
  
  by alvinrod ( 889928 ) writes:
  
  Could have just been a bit that scraped various websites for the images. A little early for pitchforks I think.
  
  But this does sound like the kind of prank that 4chan or some similar site would pull. They did something similar to a chat bot AI on Twitter several years ago.
  
  Incidentally the best way to prevent this in the future is to have a crazy racist AI that would be good at detecting these things.
  - Re: (Score:2)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
Racist databases! (Score:2)

by nospam007 ( 722110 ) * writes:

Now we have seen everything!
- Re: (Score:3, Insightful)
  
  by hey! ( 33014 ) writes:
  
  As someone who's spent decades working with databases, it surprises me anyone would be mystified by a database being "racist". Nothing could be more commonplace than a bad system distorting a decision-making process. A system is only as good as peoples' ability to recognize when it has problems.
  Sure: a database isn't intelligent or self-critical, so it can't *have* racist opinions. But that doesn't mean it can't *embody* or even *enforce* racist attitudes.
  The classic example is redlining, where black nei
Orwell would be so proud... (Score:2, Insightful)

by Anonymous Coward writes:

This looks all the world to me to be almost exactly what Orwell wrote in fiction. ANY history that violates the accepted standards MUST be erased, rephrased, torn down or otherwise sanitized or you are going to get yourself torched, figuratively or literally..
This is a golden opportunity (Score:5, Funny)

by Tangential ( 266113 ) writes: on Wednesday July 01, 2020 @01:18PM (#60251124) Homepage

This is a golden opportunity. Rather than retiring this AI training system, it should be used to develop highly specialized AIs who's whole purpose in life is to answer calls from telemarketers.

- Re: (Score:2)
  
  by thegarbz ( 1787294 ) writes:
  
  This is a golden opportunity. Rather than retiring this AI training system, it should be used to develop highly specialized AIs who's whole purpose in life is to answer calls from telemarketers.
  That's too nice. I'd just direct them to a suitable Rickroll.
- Re: (Score:2)
  
  by PPH ( 736903 ) writes:
  
  answer calls from telemarketers
  Lenny from the 'hood.
Please don't judge (Score:2)

by SchroedingersCat ( 583063 ) writes:

"The database also contained close-up pictures of female genitalia labeled with the C-word". Scientific research is tedious and lonely. Please don't judge.
- Re: (Score:2)
  
  by Robert Goatse ( 984232 ) writes:
  
  "The database also contained close-up pictures of female genitalia labeled with the C-word". Scientific research is tedious and lonely. Please don't judge.
  how about if instead of the "c-word", they said pussy. Is that less worse?
Rap music (Score:5, Insightful)

by ichthus ( 72442 ) writes: on Wednesday July 01, 2020 @01:20PM (#60251140) Homepage

these systems may also label women as whores or bitches
So, basically, they used rap music lyrics in the language training.

- Re:Rap music (Score:5, Funny)
  
  by Anonymous Coward writes: on Wednesday July 01, 2020 @01:26PM (#60251166)
  
  All lyrics matter
  
  - Re: (Score:1)
    
    by Tablizer ( 95088 ) writes:
    
    Much of western pop music is misogynous regardless of ethnicity of the writer.
    - - Re: (Score:2)
        
        by fibonacci8 ( 260615 ) writes:
        
        https://spinditty.com/genres/5... [spinditty.com]
        That was easy, your turn.
        
        Re: (Score:2)
        
        by dwpro ( 520418 ) writes:
        
        Misogyny and being creepy are hardly the same thing. You might as well have linked this one: https://www.classicfm.com/musi... [classicfm.com]
        
        that was easy.
        
        Re: (Score:2)
        
        by Lije Baley ( 88936 ) writes:
        
        Now you've stepped in it, presuming what the demographics of classical music are...
      - Re: (Score:2)
        
        by ichthus ( 72442 ) writes:
        
        still fairly common. Don't forget, "To the moon, Alice!"
        You had to go all the way back to "The Honeymooners" to find an example to support your argument?
- Re: (Score:2)
  
  by presidenteloco ( 659168 ) writes:
  
  Most likely the training process for this AI model just pulled all the captioned images it could find off the general web. This thing was just learning, as innocently as a machine or a child, the filthy swill that is included in the complete catalog of human communication.
This Explains (Score:2)

by Thelasko ( 1196535 ) writes:

This explains so much. [wikipedia.org]
Next year (Score:2)

by Kohath ( 38547 ) writes:

"AI Systems use Racially Insensitive Language and the Problem is Unsolvable" because you're not allowed to use insensitive language in training sets to train models what not to do.
Wasn't there a story a couple years ago about pictures of black folks being pulled out of training sets because AI mischaracterized them? And then a story from earlier this year about a black guy misidentified by facial recognition?
I guess we'll hear it's systemic racism because the AI systems are poorly trained with only politic
- Re: (Score:1)
  
  by gbjbaanb ( 229885 ) writes:
  
  and the COMPAS AI that didn't give many black offenders bail because it turned out that black offenders more likely committed crimes that are associated with offenders jumping bail. But that was racist too.
- Re: (Score:2)
  
  by bobbied ( 2522392 ) writes:
  
  As with all Machine Learning situations... IF you have garbage training data, you will have poor performance on real data.
Search and Replace (Score:3, Interesting)

by nwaack ( 3482871 ) writes: on Wednesday July 01, 2020 @01:37PM (#60251212)

Apparently search and replace doesn't work in this database? The snowflake cancel culture strikes again!

Turing test (Score:2)

by backslashdot ( 95548 ) writes:

Well how is an AI supposed to pass the Turing test without being mysogynistic and racist?
Unpopular truth here. (Score:5, Insightful)

by DaveV1.0 ( 203135 ) writes: on Wednesday July 01, 2020 @01:47PM (#60251266) Journal

The unpopular truth is that it isn't racist or sexist. It is acting like members of the groups it is offending.
these systems may also label women as whores or bitches
Like how rap songs label women as whores and bitches? Like how many women refer to other women as whores and bitches?
Black and Asian people with derogatory language
Like how Black and Asian people will call members of their friends words considered derogatory language in songs and in person?
The database also contained close-up pictures of female genitalia labeled with the C-word.
The "C-word" isn't that offensive in many English speaking countries, especially when referring to female genitalia. That literally falls under "talk dirty to me"

- Re: (Score:3)
  
  by Dixie_Flatline ( 5077 ) writes:
  
  It really seems to me that this dataset could also be used to teach the network what words are bad and what words to avoid? Like, I know all the words that are being described here even though some of them aren't being fully spelled out. I also know not to use them, and I know that people that DO use them either are part of some in-group that I'm not, or they're jerks. I've been able to learn how to discern those two groups.
  When I was in grade 9, my French teacher told us that she would teach us French swea
  - Re:Unpopular truth here. (Score:5, Insightful)
    
    by xonen ( 774419 ) writes: on Wednesday July 01, 2020 @03:21PM (#60251664) Journal
    
    The problem with banning words is that other words will pop up spontaneously to replace them. It's a cat-and-mouse game that can't be won.
    
    - Re: (Score:2)
      
      by ToasterMonkey ( 467067 ) writes:
      
      The problem with banning words is that other words will pop up spontaneously to replace them. It's a cat-and-mouse game that can't be won.
      And they can all be found in a thesaurus. Or a dictionary. Or urban dictionary. If that is how you want to enrich your data.
      Not in direct image-word associations used to train dumb-as-dirt AI pattern recognition systems.
      What if AI is not dumb as dirt, it is, but what if? Is there value in this form of knowledge then?
      In what circumstances is it useful to train an adult, a child, an animal, or a machine with picture flash cards labeled c**t or n****r?
      When is that EVER ok, or even in the remotest way usefu
- Re: (Score:2)
  
  by ToasterMonkey ( 467067 ) writes:
  
  The unpopular truth is that it isn't racist or sexist. It is acting like members of the groups it is offending.
  Like how rap songs label women as whores and bitches? Like how many women refer to other women as whores and bitches?
  Who in their right mind would even think to train an image recognition system using rap lyrics. Like they fed it rap music videos... to connect the speech recognition.. of rap music, and the images... of a music video.. to learn some useful fucking thing and you suppose that is likely what happened? Are you for real?
  Human beings labeled the pictures in the dataset, and some of them were being fucking dildos. This isn't some "I learned it from you dad" moment for reflection bullshit, it's a g.d. computer,
  - Re: (Score:2)
    
    by DaveV1.0 ( 203135 ) writes:
    
    Are you saying that the urban culture from which rap arose is somehow inferior and it's word choices should be ignored?
    
    Are you saying that urban culture should be ignored by AI effectively creating an AI that is biased towards Eurocentric normative culture?
    
    Be careful what you wish for, you might find yourself being called a racists for saying rap music and urban slang shouldn't be used to train AIs.
AI Safety (Score:2, Interesting)

by edi_guy ( 2225738 ) writes:

I did not RTFA but it does seem like some rules around AI safety are in order. Unexpected outcomes like the one mentioned will get worse and worse unless the researchers take AI safety more seriously. I like posts from Robert Miles https://www.youtube.com/channe... [youtube.com] who discusses these matters on Computerphile and his own channel. He is of the opinion that AI researchers need to start taking this seriously now, even though it seems like the work isn't that far along. A comparable and topical field might
- Re: (Score:2)
  
  by account_deleted ( 4530225 ) writes:
  
  Comment removed based on user account deletion
depends on what it is for... (Score:2)

by fish_in_the_c ( 577259 ) writes:

I mean if I search for pickers of 'pick your expletive' shouldn't I get pictures?
Is the purpose to allow people to find what they want or to label things that exist.
OBVIOUSLY you should not label things in an offensive matter, but if you have a poorly educated person who legitimately is looking for information on female anatomy because they have a medical problem and have always called it 'my c-word' why shouldn't the AI be able to tell what the person is asking about?
- Re: (Score:2)
  
  by iggymanz ( 596061 ) writes:
  
  I disagree, I have the freedom and liberty to label things offensively if I so choose. If I want to call Biden a "demented half-dead meat puppet" or Trump "goth eyed orange orangutan" and you are offended, that is your tough shit.
Sounds hilarious (Score:2)

by MobyDisk ( 75490 ) writes:

This sounds hilarious. Imagine someone making an application based on this data set:
Maybe a blind guy is using a dating app, and his phone is verbally captioning the images using a computerized voice.
1. User loads app.
2. Computerized voice narrates what is on the screen: "Next icon. Previous icon disabled. Like icon. Email icon. Image of sweet milf twerking."
3. User clicks next.
4. Computerized voice says: "Next icon. Previous icon. Like icon. Email icon. Image of fat c*nt in a bikini."
Or a messaging app th
Just like Grandpa (Score:2)

by Joe2020 ( 6760092 ) writes:

Always teaching kids words they shouldn't learn.
Thank you (Score:2)

by beepsky ( 6008348 ) writes:

Thank you for protecting us from bad words at the expense of scientific progress.
Doing god's work.
This might be for the best, but... (Score:2)

by Headw1nd ( 829599 ) writes:

So it seems a big issue with this in part what the database has been used for, which is probably more than planned. Certainly the inclusion of blatantly racist labels is not something that helps in any way. However, reading this I can see there is a real need to come to an agreement on what things should be excluded from certain databases but must be included in others. For example, the comment by Prabhu and Birhane "You don’t need to include racial slurs, pornographic images, or pictures of children"
Sign of the Times (Score:2)

by organgtool ( 966989 ) writes:

I don't expect this opinion to be very popular, but any AI that doesn't understand racism, misogyny, misandry, homophobia, or vulgar language will be pretty fucking useless in 2020.
reductio ad absurdum (Score:1)

by k_m00n ( 7011096 ) writes:

^ this ^ Also, why so many half-formed and tangential arguments on the board when whatâ(TM)s problematic are the implications and actual harm stupid humans inflict in the array of ways they choose to apply/use AI? TL;DR: So, you give a chainsaw to a 3-year old to cut a cake? GTFOâ"
So Tay was trained by MIT? (Score:2)

by ayesnymous ( 3665205 ) writes:

So Tay was trained by MIT?
incomplete assessment (Score:2)

by Tom ( 822 ) writes:

Sure those are terms you don't use in polite society.
They are also correct. In the sense that these are words that are sometimes used to describe those people or objects shown on those pictures.
We just don't teach our AIs good behaviour and which things to say and which not. That may be a necessary future step. To teach synonyms, including labels that state that this synonym it should understand, but not use.
Except, of course, in its proper context. Most insults are descriptive terms for other things that a
wait... (Score:2)

by argStyopa ( 232550 ) writes:

If they're removing sources that ".. label women as whores or bitches..." isn't that exclusionary of black rap culture?
Shouldn't the words still be known? (Score:2)

by sabbede ( 2678435 ) writes:

Let trained networks be aware of the words, but make sure they know not to use them. Unless they're really mad.
- Re: (Score:2, Insightful)
  
  by Archangel Michael ( 180766 ) writes:
  
  Everything is racist, and therefore racist because racism.
  Or, we can simply start teaching people that some people are assholes and the best thing to do with them is to ignore them. I'm more concerned with the BLM rioting, looting and hypocrisy of the CHAZ "Security" (aka the police) killing a 16 year old black kid and BLM not saying anything than I am of the KKK. There are way more idiots supporting the terrorist organization dress up in Political Correct Clothes. You can see evidence of this in St Louis
  - Re: (Score:1)
    
    by BringsApples ( 3418089 ) writes:
    
    Amen. And it's important, maybe, to know the actual definition of racism. Racism means that you feel that you're race is superior to another race. If you don't like thugs, that's not racist.
    Also, if you think about it, deeply, the most racist word in existence is "race". What does it even mean? From duckduckgo:
    1 Noun A group of people identified as distinct from other groups because of supposed physical or genetic traits shared by the group. Most biologists and anthropologists do not recognize race as
    - Re: (Score:2)
      
      by Chris Mattern ( 191822 ) writes:
      
      Strictly speaking, it doesn't mean anything. Any biologist can tell tell you that "race" is a null term, that is not definable in any consistent way.
      - Re: (Score:2)
        
        by BringsApples ( 3418089 ) writes:
        
        Only intelligent people think like that.
        
        Re: (Score:2)
        
        by Rick Schumann ( 4662797 ) writes:
        
        Then that excludes you.
        
        Re: (Score:2)
        
        by BringsApples ( 3418089 ) writes:
        
        There's a pig farmer near me, and over the years we've become buddies. I give him apples that aren't going to market, to feed his pigs. He's got one pig that's over 500lbs. Whenever that pig sees me pull up, it gets as happy as a pig can get. I know it wants to wrestle, but I've always heard that if you wrestle a pig, you'll get all dirty, but the pig will have fun. As tempting as it is to wrestle that thing, on days when I'm in a bad mood, I never do. That's about as intelligent as I get. Sorry.
      - Re: (Score:2)
        
        by phantomfive ( 622387 ) writes:
        
        zebra being notoriously impossible to tame even in modern times
        Not really impossible, just really hard.
        
        Africa had plenty of domesticated animals [wikipedia.org], from ancient times [springer.com]. They also used ploughs. The tsetse fly prevents horses from surviving very long in much of Africa.
        
        Re: (Score:2)
        
        by Chris Mattern ( 191822 ) writes:
        
        "The amount of melanin in your skin is important depending on the area you're in"
        Yes, it is. But trying to hook up individual adaptive traits into one unified concept of "race" does not work.
        "Africa and North America never had breeds of horse and other beast of burden that were domesticated (zebra being notoriously impossible to tame even in modern times"
        Zebras are not a race or breed of horse, they are a separate species. Horses are Equus ferus (with most domesticated examples being the subspecies Equus
    - Re: (Score:2)
      
      by Rick Schumann ( 4662797 ) writes:
      
      Fuck off.
- Re: (Score:2)
  
  by Rick Schumann ( 4662797 ) writes:
  
  Ah, I see. So if people start referring to you as "whitey" or "cracker" you're okay with that? Apparently there's a whole list of them. Where are your ancestors from? There's one specifically for you.
- Re: (Score:2)
  
  by UnknownSoldier ( 67820 ) writes:
  
  Mod parent +1 informative. I would also add:
  Science, by definition, is amoral.
  1. Test.
  2. Observe.
  3. Repeat.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Did somebody back this up? (Score:2, Insightful)

Doomed to fail turing test (Score:4, Insightful)

Re: (Score:2)

Re: (Score:3)

Re: (Score:1)

Re: (Score:3, Informative)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Great. (Score:2)

Re:Great. (Score:4, Funny)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Another good guess by Robert Heinlein (Score:5, Insightful)

Re: (Score:2)

Re: (Score:3, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Racist databases! (Score:2)

Re: (Score:3, Insightful)

Orwell would be so proud... (Score:2, Insightful)

This is a golden opportunity (Score:5, Funny)

Re: (Score:2)

Re: (Score:2)

Please don't judge (Score:2)

Re: (Score:2)

Rap music (Score:5, Insightful)

Re:Rap music (Score:5, Funny)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

This Explains (Score:2)

Next year (Score:2)

Re: (Score:1)

Re: (Score:2)

Search and Replace (Score:3, Interesting)

Turing test (Score:2)

Unpopular truth here. (Score:5, Insightful)

Re: (Score:3)

Re:Unpopular truth here. (Score:5, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

AI Safety (Score:2, Interesting)

Re: (Score:2)

depends on what it is for... (Score:2)

Re: (Score:2)

Sounds hilarious (Score:2)