Please create an account to participate in the Slashdot moderation system

 



Forgot your password?
typodupeerror
×
AI China Microsoft Technology

Microsoft's AI Generates Voices That Sing in Chinese and English (venturebeat.com) 32

Researchers at Zhejiang University and Microsoft claim they've developed an AI system -- DeepSinger -- that can generate singing voices in multiple languages by training on data from music websites. From a report: In a paper published on the preprint Arxiv.org, they describe the novel approach, which leverages a specially-designed component to capture the timbre of singers from noisy singing data. The work -- like OpenAI's music-generating Jukebox AI -- has obvious commercial implications. Music artists are often pulled in for pick-up sessions to address mistakes, changes, or additions after a recording finishes. AI-assisted voice synthesis could eliminate the need for these, saving time and money on the part of the singers' employers.

But there's a darker side: It could also be used to create deepfakes that stand in for musicians, making it seem as though they sang lyrics they never did (or put them out of work). In what could be a sign of legal battles to come, Jay-Z's Roc Nation label recently filed copyright notices against videos that used AI to make him rap Billy Joel's "We Didn't Start the Fire." As the researchers explain, singing voices have more complicated patterns and rhythms than normal speaking voices. Synthesizing them requires information to control the duration and the pitch, which makes the task challenging. Plus, there aren't many publicly available singing training data sets, and songs used in training must be manually analyzed at the lyrics and audio level.

This discussion has been archived. No new comments can be posted.

Microsoft's AI Generates Voices That Sing in Chinese and English

Comments Filter:

The biggest difference between time and space is that you can't reuse time. -- Merrick Furst

Working...