Leon Neal
OpenAI, the maker of the popular ChatGPT chatbot and backed by “billions” from Microsoft (NASDAQ:MSFT), said the AI chatbot is now able to “see, hear and speak” spoken words, thanks to a new update.
In a blog post, the Sam Altman-led company said the update will allow users on the app to opt into conversations and have the app respond. They can share images with ChatGPT and ask it questions, with the chatbot responding.
“The new voice capability is powered by a new text-to-speech model, capable of generating human-like audio from just text and a few seconds of sample speech,” OpenAI wrote in the post.
“We collaborated with professional voice actors to create each of the voices,” OpenAI continued. “We also use Whisper, our open-source speech recognition system, to transcribe your spoken words into text.”
The voice update will be available on the iOS and Android app, with users required to opt-in. The image update will be available on all version, OpenAI added.
The company added that the deployment of image and voice capabilities would come “gradually” and would be available to Plus and Enterprise users in the next two weeks. From there, it would roll out to other users and developers at an unspecified timeline.