This new AI can mimic human voices with only 3 seconds of training

Humanity has taken yet another step toward the inevitable war against the machines (which we will lose) with the creation of Vall-E, an AI developed by a team of researchers at Microsoft that can produce high quality human voice replications with only a few seconds of audio training.

Vall-E isn’t the first AI-powered voice tool— xVASynth (opens in new tab), for instance, has been kicking around for a couple years now—but it promises to exceed them all in terms of pure capability. In a paper available at Cornell University (opens in new tab) (via Windows Central (opens in new tab) ), the Vall-E researchers say that most current text-to-speech systems are limited by their reliance on “high-quality clean data” in order to accurately synthesize high-quality speech.

Blog