Google's SoundFilter AI Separates Any Sound or Voice From Mixed-Audio Recordings (venturebeat.com) 28
Researchers at Google claim to have developed a machine learning model that can separate a sound source from noisy, single-channel audio based on only a short sample of the target source. In a paper [PDF], they say their SoundFilter system can be tuned to filter arbitrary sound sources, even those it hasn't seen during training. From a report: The researchers believe a noise-eliminating system like SoundFilter could be used to create a range of useful technologies. For instance, Google drew on audio from thousands of its own meetings and YouTube videos to train the noise-canceling algorithm in Google Meet. Meanwhile, a team of Carnegie Mellon researchers created a "sound-action-vision" corpus to anticipate where objects will move when subjected to physical force. SoundFilter treats the task of sound separation as a one-shot learning problem. The model receives as input the audio mixture to be filtered and a single short example of the kind of sound to be filtered out. Once trained, SoundFilter is expected to extract this kind of sound from the mixture if present.
Combine with deepfake tech (Score:1)
So now (Score:5, Insightful)
even if you put the radio on to cover your discussion, the Google surveillance collective can still make out what you're saying.
Everything that company does is creepy on one lever or another...
Re: (Score:2)
Just move the fulcrum.
Re: (Score:3)
Amazon: Let's just put a nice microphone array in our smart speakers so we can always hear what you're saying.
Google: Let's throw the cheapest hardware out there and just fix it with software
If it does what I think it does, this is real AI (Score:4, Insightful)
Re: (Score:2)
The ability to pick out a single speaker from a cacophony filled room with lots of other people talking or making noise such as at a night club or party is actually quite an incredible trait that humans have. If this software can do that from a short audio sample of the target, I'd call that real AI.
I can't do it myself. I'm 45 years old now and have never been able to follow conversations in night clubs, loud parties, nor even most restaurants when I'm in a group and am trying to hear the people on the other side of the table. I can hear quiet sounds well - my brain just doesn't convert the sounds into words. I suspect normal people do better than me maybe because they subconsciously lip-read a little, or maybe because they're used to filling in the blanks much more than I do.
Re: (Score:2)
That may mean you have an underlying issue - e.g. https://www.healthyhearing.com... [healthyhearing.com] (which obviously doesn't mean that you're autistic; this and other hearing issues can happen as part of, or independent from, that).
I have a similar issue, plus mild deafness, so I do tend to benefit from seeing the lips of the speaker too.
Re: (Score:2)
Jungle animals can usually exhibit some type of Cocktail Party Effect too.
Ours is tuned for our language but hearing mating calls over the din has reproduction advantage too.
Re: (Score:1)
No, it isn't. (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
It appears you have invented a rather arbitrary definition of intelligence that has no bearing on how it is ordinarily defined by the rest of the world.
Every creature on the planet with a nervous system is intelligent, but your criteria for intelligence would preclude that.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
And just how long do you think it takes someone who is not a linguist savant to learn a language without having any prior exposure to it?
My point being that it generally requires far more than just a small sample. Yet once a language is known and understood (vast amounts of data), you can effectively communicate a concept in that language simply by knowing what concept to communicate (small amounts of data).
Or do you believe that the ability to coherently communicate ideas is not a demonstration of in
Re: (Score:2)
Re: (Score:2)
Not at all... I am saying that virtually everything that you ever learn how to do takes a shitpile of data input before you know how to do it correctly. Even your example of hypothetically making a sandwich without having ever seen a kitchen would itself draw on a vast breadth of knowledge acquired from numerous sources, most of which would probably not even be consciously aware of.
As it happens, I picked something else also related to listening... it was not a deliberate choice.
Re: (Score:1)
Most importantly (Score:3)
Re: (Score:1)
DMCA proofing videos? (Score:2)
blind signal separation (Score:2)
http://www.irisa.fr/metiss/oze... [irisa.fr]
Another use (Score:2)
This has potential for the hearing impaired. (Score:1)