OpenAI collaborated with various professional voice actors and used its open-source speech recognition software called Whisper to enable a seamless experience for ChatGPIT users.
Leading Generative Artificial Intelligence (AI) company backed by Microsoft Corporation (NASDAQ: MSFT) OpenAI has announced the addition of more features to its chatgpt This platform is for paid users. According to the announcement, ChatGPT can now see, hear and speak, allowing for a more interactive experience with users. Specifically, new ChatGPT features are expected to be launched in the next two weeks. Previously, ChatGPT had only a text-based generative model, which is now complemented by image search and voice recognition features.
The company noted that the voice recognition feature will be available for iOS and Android users, while image search will be rolled out to all devices across platforms.
“You can now use voice to engage in back-and-forth conversations with your Assistant. “Talk with it on the go, request a bedtime story for your family, or settle an argument at the dinner table,” OpenAI noted In the announcement.
Giving users a choice of five different voices, the company highlighted that the feature is capable of generating human-like audio from text. Additionally, the ChatGPIT voice feature is powered by a new text-to-speech model, whereby the feature also uses the company’s Whisper system to transcribe users’ spoken words into a text-based form.
ChatGPT becomes more personal with new features
For image search on ChatGPT, users can now get detailed information about a given image by combining it with the drawing tools in the mobile application. For example, a ChatGPT user can circle a certain part of the image where the AI needs more detail to help generate better results.
“Image understanding is powered by multimodal GPT-3.5 and GPT-4. “These models apply their language reasoning skills to a wide range of images, such as photos, screenshots, and documents containing both text and images,” the company said.
The company is dedicated to introducing new services where it is needed most, including people living with disabilities, manufacturing companies that need to scale up their operations, and eliminating the language barrier. For example, Spotify is harnessing the power of this technology for a pilot program that helps podcasters expand the reach of their storytelling by translating podcasts into additional languages in their native voices.
OpenAI and Market Outlook
The launch of voice and image recognition features in ChatGPIT will help OpenAI remain competitive in a dynamic environment full of competitors. On Monday, Amazon.com Inc (NASDAQ: AMZN) announced A $4 billion strategic investment in an AI startup called Anthropic. Nonetheless, demand for AI products remains high around the world, so it is able to accommodate even more startups.
Let’s talk crypto, metaverse, NFTs, CedeFi, and stocks and focus on multi-chain as the future of blockchain technology. Let us all win!
Bitcoin Crypto Related Post