MY KOLKATA EDUGRAPH
ADVERTISEMENT
regular-article-logo Wednesday, 03 July 2024

OpenAI has technology for voice cloning but it’s not available for general use

Voice Engine is an expansion of the company’s existing text-to-speech API. It allows users to upload any 15-second voice sample to generate a synthetic copy of that voice

Mathures Paul Published 01.04.24, 10:58 AM
After AI model Sora, OpenAI has built a voice cloning tool

After AI model Sora, OpenAI has built a voice cloning tool Illustration: The Telegraph

OpenAI, which is behind ChatGPT, has come up with a voice cloning AI model that only needs a 15-second sample to work.

Voice Engine is an expansion of the company’s existing text-to-speech API. It allows users to upload any 15-second voice sample to generate a synthetic copy of that voice. But there’s no date for public availability yet.

ADVERTISEMENT

“These small-scale deployments are helping to inform our approach, safeguards, and thinking about how Voice Engine could be used for good across various industries,” OpenAI said in its blog post.

Companies that have access include the education technology company Age of Learning, visual storytelling platform HeyGen, frontline health software maker Dimagi, AI communication app creator Livox, and health system Lifespan, according to The Verge. The technology may have huge implications for those who record themselves speaking often, like podcasters, voice-over artists and spoken word performers.

Follow us on:
ADVERTISEMENT