Your data. Your choice.

If you select «Essential cookies only», we’ll use cookies and similar technologies to collect information about your device and how you use our website. We need this information to allow you to log in securely and use basic functions such as the shopping cart.

By accepting all cookies, you’re allowing us to use this data to show you personalised offers, improve our website, and display targeted adverts on our website and on other websites or apps. Some data may also be shared with third parties and advertising partners as part of this process.

News + Trends

Microsoft's VALL-E imitates any voice - three seconds of recording is enough

Martin Jud
11/1/2023
Translation: machine translated

DALL-E is followed by VALL-E: Microsoft and OpenAI have created a new artificial intelligence (AI) that can imitate voices. A voice recording of just three seconds should be enough input for the AI.

Today we know: What photos or videos show doesn't necessarily have to have happened. Since ChatGPT and DALL-E, it's also clear that a text doesn't necessarily have to come from an author's pen or a picture from an artist's brush. Now it's the voice's turn.

Microsoft is aware that the technology also has potential for misuse. For this reason, a protocol in future applications will ensure that content created by VALL-E can be recognised as such.

The AI delivers impressive results with the examples presented by Microsoft. For its training, 60,000 hours of English language recordings were processed. This corresponds to a hundred times the input of existing speech syntheses.

Cover image: shutterstock

52 people like this article


User Avatar
User Avatar

I find my muse in everything. When I don’t, I draw inspiration from daydreaming. After all, if you dream, you don’t sleep through life.


News + Trends

From the latest iPhone to the return of 80s fashion. The editorial team will help you make sense of it all.

Show all