• Voice Engine, a text-to-voice generation platform developed by OpenAI, is providing access to create synthetic voice.
  • Unethical uses of AI voice technology might result in spams and cause concerns.

The voice clone model Voice Engine has been in development since late 2022, and it can generate a synthetic voice according to a 15-second clip of someone’s voice. However, as generative AI continues to develop, ethical concerns follow as well.

OpenAI introduces Voice Engine

OpenAI has unveiled Voice Engine, a text-to-voice generation platform capable of creating synthetic voices based on short voice clips.

Also read: OpenAI voice-clone tool mimics your voice with 15-second sample

This innovative technology can produce AI-generated voices that read text prompts in multiple languages, offering potential applications across various industries.

Limited access to Voice Engine has been granted to select companies, including Age of Learning, HeyGen, Dimagi, Livox, and Lifespan.

OpenAI’s approach in ethical considerations

Last month, after people received spam calls from an AI-Cloned voice of President Joe Biden, the Federal Communications banned robocalls.

To address concerns, OpenAI introduces ethical guidelines surrounding AI voice technology. Partners are required to adhere to usage policies prohibiting impersonation without consent, obtaining explicit speaker consent, and disclosing AI-generated voices to listeners.

OpenAI also implements watermarking to trace audio origin and actively monitors its usage. The initiative makes a broader effort to mitigate AI-related risks, including phasing out voice-based authentication, implementing policies to safeguard individuals’ voices, enhancing education on AI deepfakes, and developing tracking systems for AI content.