AI Hears Your Anger in 1.2 Seconds (venturebeat.com) 51
MIT Media Lab spinoff Affectiva's neural network, SoundNet, can classify anger from audio data in as little as 1.2 seconds regardless of the speaker's language -- just over the time it takes for humans to perceive anger. From a report: Affectiva's researchers describe it ("Transfer Learning From Sound Representations For Anger Detection in Speech") in a newly published paper [PDF] on the preprint server Arxiv.org. It builds on the company's wide-ranging efforts to establish emotional profiles from both speech and facial data, which this year spawned an AI in-car system codeveloped with Nuance that detects signs of driver fatigue from camera feeds. In December 2017, it launched the Speech API, which uses voice to recognize things like laughing, anger, and other emotions, along with voice volume, tone, speed, and pauses.
SoundNet consists of a convolutional neural network -- a type of neural network commonly applied to analyzing visual imagery -- trained on a video dataset. To get it to recognize anger in speech, the team first sourced a large amount of general audio data -- two million videos, or just over a year's worth -- with ground truth produced by another model. Then, they fine-tuned it with a smaller dataset, IEMOCAP, containing 12 hours of annotated audiovisual emotion data including video, speech, and text transcriptions.
SoundNet consists of a convolutional neural network -- a type of neural network commonly applied to analyzing visual imagery -- trained on a video dataset. To get it to recognize anger in speech, the team first sourced a large amount of general audio data -- two million videos, or just over a year's worth -- with ground truth produced by another model. Then, they fine-tuned it with a smaller dataset, IEMOCAP, containing 12 hours of annotated audiovisual emotion data including video, speech, and text transcriptions.
Cortana and Siri (Score:3)
Anytime Cortana or Siri popscup and gets in the way there will be anger!
AT&T Or Time Warner (Score:2)
That dumb butch they have doesn't understand shit.
Then they go all fucking stupid pretending computers make some kind of bepop noise when thinking.
Stupid fuckers.
Re: (Score:2)
Re: (Score:1)
Every time I get one of those worthless automated "assistants" instead of live customer support there will be anger. I guess the next step is having the automated "assistant" (it's not AI, sorry) determine the reason that I'm angry directly due to the automated "assistant" itself always being programmed to offer only simplistic choices that have nothing to do with my issue and wasting my time instead of getting a live person on the line. If the issue were as simple as the ones proffered by the automated "as
I'm sorry Dave (Score:2)
This conversation can serve no further useful purpose. goodbye.
It's quick, but not quick enough? (Score:2)
I can detect anger in someone's voice practically immediately, even before they've finished the first word because as a human, I use a number of other clues e.g. facial contortion, body positioning, finger pointing etc.
1.2 seconds to detect a change in pitch, volume etc. seems too long and I think that's the overall problems with artificial intelligence or machine learning - they're great for massive data sets that have common patterns (or used to build
Re: (Score:1)
It's for IVR trees (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Maybe they should consider the fact (Score:2)
that I have a resting bitchy voice. Especially when not talking to a human that speaks english.
"ground truth" (Score:2)
" the team first sourced a large amount of general audio data ... with ground truth produced by another model."
So, actually, the program wasn't detecting anger. The program was modelling what a different program detected in the signal.
Re: (Score:2)
You neglected the next part, where they fine tuned it using hand labelled data. If you're training a system that learns (and that includes people) and you've got a automatic system that performs okay, it's often a good idea to do a first round of training on the automatic results. Then you come along with a smaller, higher quality training set to boost performance over what the existing automatic system can do.
And yes, the term "ground truth" is usually used in stupid way.
When my wife gets angry (Score:1)
I can code AI (Score:2)
If volume_before * 1.5 < volume_now:
then ANGRY!
My view on AI (Score:2)
I test for this with old man profanity (Score:3)
Even German? (Score:2)
It's about empathy towards computers (Score:2)
Now they can sense the anger you have towards them for whatever reasons and say "No, please don't throw me out the window" while you are throwing it out the window (or smashing it with a hammer).
This isn't AI (Score:3)
Re: (Score:2)
Re: (Score:3)
You've identified the difference: "The terminology 15 years ago was real time monitoring with language recognition heuristics."
Heuristics are a set of rules used for decision making. In the context of algorithms, those heuristics are designed by a human and programmed into the system.
"AI" is a nonspecific term, but if it means anything it means a system that learns from experience. Specifically, it does not use preprogrammed heuristics.
Bruce Banner (Score:1)
Also does it detect passive aggressive anger? What if I yell "I LOVE YOU" at a pet, vs I whisper "I'm going to put you in the microwave and set it on high for 4 minutes, ohh yes I am, such a bad doggie you are"? What is the algorithm keying on; volume, facial expressions, changes in skin tone, words spoken? And all they did was get close to what a human could do. Come on, I thought computers were faster. Get it down to 0.000001s and I'll be i
Wells Fargo peeps don't need software to do this. (Score:2)
Wells Fargo customer service reps lately just assume everyone is pissed at them these past few weeks, espcially yesterday and today.
[stewie] Where's my money?! *WHAM!* Where's my money?!" [/stewie]
German (Score:2)
can classify anger from audio data in as little as 1.2 seconds regardless of the speaker's language
It was just a coincidence that the German speakers were angry 100% of the time ...
What's its false positive record like? (Score:2)
No, it's recognizing arousal (Score:2)
Not possible. If someone screams "Fuuuuuuuck!!!" at the top of their lungs, there is no way AI can distinguish whether it's anger, pain, frustration, surprise, or even joy, because the source signal may be identical for all of them. At best, this system is detecting high arousal and possibly unpleasant mood.
Re: (Score:2)
It'll probably also fail on people who are angry, but aren't shouting it. E.g. "I've said everything that can be said. You will refund me, or you will see your entrails hanging out of your body by tomorrow. Have a good day sir."
Re: (Score:2)
Exactly. There is no single body signature for anger [nytimes.com].
too bad (Score:2)
too bad it can't smell the fart in its general direction,
How could it tell? (Score:2)
"SUPPORT. HELP. HUMAN. OPERATOR. GET ME A FUCKING HUMAN BEING YOU GODDAMN PIECE OF SHIT! "
processing... processing... processing... anger detected 37% probability
(im not yelling slashdot im not yelling... ok i am but its on purpose let this post go through...)