Skip to main content
Daily Star

Terrifying Microsoft AI can build a robo-clone of your voice after just 3 seconds

Microsoft's 'VALL-E' artificial intelligence is capable of mimicking anybody's voice after hearing just three seconds of speech - the spooky algorithm could have terrifying consequences

Microsoft's 'VALL-E' artificial intelligence is capable of mimicking anybody's voice after hearing just three seconds of speech - the spooky algorithm could have terrifying consequences
The 'VALL-E' AI can turn text into speech using your voice(Image: Getty Images/iStockphoto)

Your voice could be digitally cloned and used to impersonate you, thanks to a creepy new AI called VALL-E.

AI has unveiled an artificial intelligence system capable of mimicking any human voice based on just three seconds of audio.


Article continues below

It can then be used to turn any written text into speech, making it possible for someone to put words in your mouth using the tool.

READ NEXT: You could face prison for sharing your Netflix password under UK law

It's even designed to recreate the 'emotional range' and pacing of the speaker, making it a hyper accurate form of mimicry.


REDMOND, WASHINGTON - JULY 17: A building on the Microsoft Headquarters campus is pictured July 17, 2014 in Redmond, Washington. Microsoft CEO Satya Nadella announced, July 17, that Microsoft will cut 18,000 jobs, the largest layoff in the company's history. (Stephen Brashear/Getty Images)
Microsoft trained the AI model on 7000 hours of English language speech(Image: Getty Images)
READ MORE: Futuristic new DeLorean looks like something straight out of Back to The FutureREAD MORE: Real-life Transformers unveiled - with Optimus Prime automatically changing into a truck

The AI tool is thankfully not yet available to the general public. Microsoft says it is a 'neural codec language model' trained on 60,000 hours of English language speech from Meta, who own Microsoft

Del, a videogame artist at 'Last of Us' creators Naughty Facebook., explained: "Using a 3-second sample of human speech, [VALL-E] can generate super-high-quality text-to-speech from the same voice.


"Even emotional range and acoustic environment of the sample data can be reproduced."

Del added that it could affect the future of audiobooks. "At the moment, VALL-E can only read, not necessarily PERFORM with the emotional, tonal and pacing range of a voice actor. However, much of the audiobook industry relies on a lot of junior voice actor talent that will undoubtedly feel the brunt of this first."

REDMOND, WASHINGTON - JULY 17: A building on the Microsoft Headquarters campus is pictured July 17, 2014 in Redmond, Washington. Microsoft CEO Satya Nadella announced, July 17, that Microsoft will cut 18,000 jobs, the largest layoff in the company's history. (Stephen Brashear/Getty Images)
Microsoft trained the AI model on 7000 hours of English language speech(Image: Getty Images)

READ MORE: Elon Musk breaks Guinness World Record for 'biggest loser' after net worth plummets

VALL-E has certainly ruffled a few feathers online. Twitter user Kevin Nash said: "This is terrifying thinking about scam callers getting their hands on this."

Another user, Christina Kraus, wrote: "What use does this even have except for scam and impersonation purposes? Why don't we focus on AI where it actually helps humanity? Why are we getting AI image generators and voice imitation? That's literally the last thing we need."

However, the tool could prove very useful in a range of contexts. People who lose the ability of speech—such as the late Stephen Hawking, who was unable to talk due to Motor Neurone Disease—could use the AI system to create replicas of their own voices in order to continue communicating with the world.

Article continues below

READ MORE:

READ MORE: Robot dog owner offers reward to track down 'drunk' woman who kicked his £8k petREAD MORE: Deepfake TV show sees Harry Kane, Stormzy and Greta Thunberg become bickering neighbours
Follow Daily Star:

MicrosoftArtificial Intelligence
reach logo

At Reach and across our entities we and our partners use information collected through cookies and other identifiers from your device to improve experience on our site, analyse how it is used and to show personalised advertising. You can opt out of the sale or sharing of your data, at any time clicking the "Do Not Sell or Share my Data" button at the bottom of the webpage. Please note that your preferences are browser specific. Use of our website and any of our services represents your acceptance of the use of cookies and consent to the practices described in our Privacy Notice and Cookie Notice.