Microsoft's AI Generates Voices That Sing in Chinese and English

Microsoft's AI Generates Voices That Sing in Chinese and English (venturebeat.com) 32

Posted by msmash on Monday July 13, 2020 @04:44PM from the pushing-the-limits dept.

Researchers at Zhejiang University and Microsoft claim they've developed an AI system -- DeepSinger -- that can generate singing voices in multiple languages by training on data from music websites. From a report: In a paper published on the preprint Arxiv.org, they describe the novel approach, which leverages a specially-designed component to capture the timbre of singers from noisy singing data. The work -- like OpenAI's music-generating Jukebox AI -- has obvious commercial implications. Music artists are often pulled in for pick-up sessions to address mistakes, changes, or additions after a recording finishes. AI-assisted voice synthesis could eliminate the need for these, saving time and money on the part of the singers' employers.

But there's a darker side: It could also be used to create deepfakes that stand in for musicians, making it seem as though they sang lyrics they never did (or put them out of work). In what could be a sign of legal battles to come, Jay-Z's Roc Nation label recently filed copyright notices against videos that used AI to make him rap Billy Joel's "We Didn't Start the Fire." As the researchers explain, singing voices have more complicated patterns and rhythms than normal speaking voices. Synthesizing them requires information to control the duration and the pitch, which makes the task challenging. Plus, there aren't many publicly available singing training data sets, and songs used in training must be manually analyzed at the lyrics and audio level.

Microsoft's AI Generates Voices That Sing in Chinese and English

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 32 Comments Log In/Create an Account

Comments Filter:

Alto Intelligence? (Score:2)

by nospam007 ( 722110 ) * writes:

What's next, tenor?
Ckoe already tried that ... (Score:2)

by CaptainDork ( 3678879 ) writes:

... "I want to teach the world to sing ..."
Meh (Score:2)

by TWX ( 665546 ) writes:

This doesn't bring any Satisfaction.
Can it... (Score:2)

by kryliss ( 72493 ) writes:

sing like Diva Plavalaguna?
- Re: (Score:2)
  
  by RogueWarrior65 ( 678876 ) writes:
  
  Multi-pass
Started out English only... (Score:2)

by SuperKendall ( 25149 ) writes:

AI only started singing in Chinese after they installed TikTok.
Artificial Art (Score:1)

by PsuedoPhil ( 6914240 ) writes:

Very strange time we live in. How do we define art when its categorically artificial that created it? Is this any different than synthesized drums, a digital guitar effects pedal?
- Re: (Score:1)
  
  by KotoKoraBanjo ( 6839470 ) writes:
  
  Humans program the drum sequences and play the guitars that are processed by the pedals. Now AI drummers, that's a good idea. As the joke goes, "You only have to punch in the rhythm once and it won't drink your beer."
President Xi deepfakes (Score:3)

by tlhIngan ( 30335 ) writes: <[ten.frow] [ta] [todhsals]> on Monday July 13, 2020 @06:04PM (#60295156)

I take it this stuff is already banned in China because well, subject line.
Though, if the deepfakes are any good, this might actually be something fun to play with.
President Xi singing the Winnie the Pooh theme song, anyone?

- Re: (Score:2)
  
  by ShanghaiBill ( 739463 ) writes:
  
  Xi's wife is a professional singer.
  When they married, she was far more famous than he was.
  For many years, he was referred to as "Peng Liyuan's husband".
Daisy, daisy, give me your answer do... (Score:2)

by YrWrstNtmr ( 564987 ) writes:

Now, build the whole rest of the AI.
LOL, no, some things are human-specific (Score:2)

by Rick Schumann ( 4662797 ) writes:

But there's a darker side: It could also be used to create deepfakes that stand in for musicians, making it seem as though they sang lyrics they never did (or put them out of work).
Oh, bullshit.
You want to tell me you can create a computer program that can 'sing'? Sure. Whatever. Maybe it'll even be technically accurate according to all standards of voice. But it'll never be a human singer. It'll never have the emotional depth that a human singer brings.
But Rick, autotune!
LOL 'autotune' just makes everyone into carbon-copies of everyone else. The 'flaws' in a singers' performance are just as important, as their technical skills as a vocalist -- and one could validly argue that the 'flaws' are sometimes
- Re: (Score:1)
  
  by Zoomer_baby ( 7045546 ) writes:
  
  Hi Rick, Check out actual cutting edge examples of where this is going here. https://venturebeat.com/2020/0... [venturebeat.com] Ai generated lyrics and music. Still hits the uncanny valley song structure wise but I think you wouldn't notice if it was in the background in a mall (Whatever a mall is ;) ) (you might just thing its a shitty artist you haven't heard of. My prediction: Streaming services will start to experiment with generated for background songs and it will slowly creep in more and more. Why not? one time cos
  - Re: (Score:2)
    
    by Rick Schumann ( 4662797 ) writes:
    
    Thanks but no thanks, they can keep it, zero interest whatsoever.
- Re: (Score:1)
  
  by Zoomer_baby ( 7045546 ) writes:
  
  Just to be clear I wholeheartedly agree with your sentiment :) REAL music will be human but what is popular will be the Taco Bell/McDonalds of music.
  - Re: (Score:2)
    
    by Rick Schumann ( 4662797 ) writes:
    
    I'm not interested in 'popular' music I'm interested in GOOD music made by good talented people. :-)
    - Re: (Score:2)
      
      by gravewax ( 4772409 ) writes:
      
      personally I am just interested in good music, I could not give a shit whether it was created, played, sung by a human, a robot or a well trained french poodle. My enjoyment of music has no relationship with what I think of the author of it.
      - Re: (Score:2)
        
        by Rick Schumann ( 4662797 ) writes:
        
        Missing the point
        I don't think so-called 'AI' created 'music' is going to be any good, and an 'AI' 'singer' even less so. Not going out of my way to even hear it, not even clicking a link. Say what you want about that.
        If and when they crack the code of the human brain and we can create *real* AI and not the ersatz they keep trotting out, and these general AI have actual personalities and are like real people, then maybe I'll give a damn. Otherwise they're just hyping garbage technology.
- Re: (Score:2)
  
  by ShanghaiBill ( 739463 ) writes:
  
  It'll never have the emotional depth that a human singer brings.
  Why not? It seems to me that emotion should be easy to fake. For non-singing voice synthesis, adjusting intonation to simulation emotion is not difficult. This tech is advancing quickly.
  Human singers win in the end.
  Human singers expect to be paid. Software performs for free.
  - Re: (Score:2)
    
    by Rick Schumann ( 4662797 ) writes:
    
    Found the non-music person.
- Re: (Score:2)
  
  by bloodhawk ( 813939 ) writes:
  
  what a load of bullshit lol, there is nothing in the human voice that cannot be replicated and improved upon. emotion is just variations in the voice, it isn't anything special. I am sure every professional that ever had the job automated all said the same garbage of you will never be as good as a human, the reality is humans are pretty fucking flawed at most things, to be the same as a human usually means dumbing down the computer algorithms.
  - Re: (Score:2)
    
    by Rick Schumann ( 4662797 ) writes:
    
    Found the other non-music person.
Pop singers have been faking for many years (Score:2)

by aberglas ( 991072 ) writes:

Manufactured sound, voice enhancers.
They no longer need to sing in tune, the machine will do it for them. Even add a little vibrato, deepen it a bit, get it in properly time with the music.
- Great video of what can be done (Score:2)
  
  by aberglas ( 991072 ) writes:
  
  https://www.youtube.com/watch?... [youtube.com]
  - Re: (Score:2)
    
    by UnknownSoldier ( 67820 ) writes:
    
    Ugh. That sounds like someone auto-tuned the crap out of it.
What until you see the translation what Tay sang (Score:2)

by thesjaakspoiler ( 4782965 ) writes:

Sorry, the Slashdot guidelines prevent me from posting inflammatory and racist lyrics.
Songsmith Deluxe (Score:1)

by CamD ( 964822 ) writes:

Microsoft has finally completed Songsmith.
https://www.youtube.com/watch?... [youtube.com]
I'm gonna sell so many towels.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Microsoft's AI Generates Voices That Sing in Chinese and English (venturebeat.com) 32

Microsoft's AI Generates Voices That Sing in Chinese and English More Login

Microsoft's AI Generates Voices That Sing in Chinese and English

Alto Intelligence? (Score:2)

Ckoe already tried that ... (Score:2)

Meh (Score:2)

Can it... (Score:2)

Re: (Score:2)

Started out English only... (Score:2)

Artificial Art (Score:1)

Re: (Score:1)

President Xi deepfakes (Score:3)

Re: (Score:2)

Daisy, daisy, give me your answer do... (Score:2)

LOL, no, some things are human-specific (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Pop singers have been faking for many years (Score:2)

Great video of what can be done (Score:2)

Re: (Score:2)

What until you see the translation what Tay sang (Score:2)

Songsmith Deluxe (Score:1)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot