ChatGPT Updated With Support for Voice Conversation, Image Recognition Features: Details

ChatGPT will soon be able to 'discuss' images shared by users and hold 'back-and-forth' conversations using five voices.

ChatGPT Updated With Support for Voice Conversation, Image Recognition Features: Details

Photo Credit: Unsplash/ @ilgmyzin

ChatGPT Plus and Enterprise subscribers will have access to the features in the coming weeks

Highlights
  • ChatGPT will soon be able to converse using text-to-speech technology
  • The chatbot is also gaining the ability to analyse and understand images
  • These ChatGPT features will roll out to users in the coming weeks
Advertisement

ChatGPT has been updated with support for voice conversations and image recognition, OpenAI announced on Monday. The company's AI-powered chatbot will soon be able to understand images captured or shared by users and provide details or related information across platforms where the chatbot is available. It will also be capable of back-and-forth conversation using OpenAI's Whisper speech recognition tool and a new text-to-speech (TTS) technology from the company that is claimed to offer "human-like" audio on the company's ChatGPT app for smartphones.

OpenAI revealed in a blog post that the company's new image recognition capability for ChatGPT will be available on all platforms, while the voice conversations feature will be available on iOS and Android via an opt-in setting. These features will be available to ChatGPT Plus and Enterprise subscribers, and there's no word on whether it will roll out to users on the free tier in the future.

The voice conversations coming to ChatGPT can be enabled by going to Settings > New Features and toggling the option to enable voice conversations. You can then select from five voices — OpenAI says it has worked with professional voice actors to offer the new feature. The ChatGPT app will be able to answer questions by converting your spoken queries into text that can be understood by the chatbot, and responses will be turned into audio using the company's new TTS technology.

ChatGPT isn't the only service that will use OpenAI's new TTS technology — Spotify on Monday announced a new AI-based voice translation tool for podcast creators that can automatically translate a podcast from English to French, German, and Spanish. The tool is being tested with a few podcast hosts and translated episodes will be available to all users wherever Spotify is available, according to the streaming platform. 

OpenAI says the new image recognition tool runs on the company's multimodal GPT-3.5 and GPT-4 models and are capable of analysing images and text contained in photographs, screenshots, and documents. Users can either capture an image or share an existing one on their phone with ChatGPT to get insights from the chatbot.

ChatGPT will also allow users to share multiple images that can be discussed with the chatbot, according to OpenAI. If you want it to focus on a specific area, the built-in drawing tool will allow you to mark a part of the image. For example, drawing around a dislodged bicycle chain in a photo shared with ChatGPT might allow the chatbot to show you ways to fix the problem.


Is the Samsung Galaxy Z Flip 5 the best foldable phone you can buy in India right now? We discuss the company's new clamshell-style foldable handset on the latest episode of Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
Comments

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

David Delima
As a writer on technology with Gadgets 360, David Delima is interested in open-source technology, cybersecurity, consumer privacy, and loves to read and write about how the Internet works. David can be contacted via email at DavidD@ndtv.com, on Twitter at @DxDavey, and Mastodon at mstdn.social/@delima. More
Bitcoin, Ether See No Gains Despite Tether, Chainlink and Other Altcoins Recording Profits
SAG-AFTRA Video Game Performers Vote to Authorise Strike Against Publishers and Studios
Share on Facebook Gadgets360 Twitter Share Tweet Snapchat Share Reddit Comment google-newsGoogle News
 
 

Advertisement

Follow Us

Advertisement

© Copyright Red Pixels Ventures Limited 2024. All rights reserved.
Trending Products »
Latest Tech News »