ChatGPT Updated With Support for Voice Conversation, Image Recognition Features: Details

ChatGPT will soon be able to 'discuss' images shared by users and hold 'back-and-forth' conversations using five voices.

Advertisement
Written by David Delima, Edited by Siddharth Suvarna | Updated: 26 September 2023 12:56 IST
Highlights
  • ChatGPT will soon be able to converse using text-to-speech technology
  • The chatbot is also gaining the ability to analyse and understand images
  • These ChatGPT features will roll out to users in the coming weeks

ChatGPT Plus and Enterprise subscribers will have access to the features in the coming weeks

Photo Credit: Unsplash/ @ilgmyzin

ChatGPT has been updated with support for voice conversations and image recognition, OpenAI announced on Monday. The company's AI-powered chatbot will soon be able to understand images captured or shared by users and provide details or related information across platforms where the chatbot is available. It will also be capable of back-and-forth conversation using OpenAI's Whisper speech recognition tool and a new text-to-speech (TTS) technology from the company that is claimed to offer "human-like" audio on the company's ChatGPT app for smartphones.

OpenAI revealed in a blog post that the company's new image recognition capability for ChatGPT will be available on all platforms, while the voice conversations feature will be available on iOS and Android via an opt-in setting. These features will be available to ChatGPT Plus and Enterprise subscribers, and there's no word on whether it will roll out to users on the free tier in the future.

The voice conversations coming to ChatGPT can be enabled by going to Settings > New Features and toggling the option to enable voice conversations. You can then select from five voices — OpenAI says it has worked with professional voice actors to offer the new feature. The ChatGPT app will be able to answer questions by converting your spoken queries into text that can be understood by the chatbot, and responses will be turned into audio using the company's new TTS technology.

Advertisement

ChatGPT isn't the only service that will use OpenAI's new TTS technology — Spotify on Monday announced a new AI-based voice translation tool for podcast creators that can automatically translate a podcast from English to French, German, and Spanish. The tool is being tested with a few podcast hosts and translated episodes will be available to all users wherever Spotify is available, according to the streaming platform. 

Advertisement

OpenAI says the new image recognition tool runs on the company's multimodal GPT-3.5 and GPT-4 models and are capable of analysing images and text contained in photographs, screenshots, and documents. Users can either capture an image or share an existing one on their phone with ChatGPT to get insights from the chatbot.

ChatGPT will also allow users to share multiple images that can be discussed with the chatbot, according to OpenAI. If you want it to focus on a specific area, the built-in drawing tool will allow you to mark a part of the image. For example, drawing around a dislodged bicycle chain in a photo shared with ChatGPT might allow the chatbot to show you ways to fix the problem.

Advertisement


Is the Samsung Galaxy Z Flip 5 the best foldable phone you can buy in India right now? We discuss the company's new clamshell-style foldable handset on the latest episode of Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. Android Security Alert: Billions of Devices at High Risk, Warns CERT-In
  2. Samsung Galaxy S26 Ultra and Galaxy S26 Pro First Look Leaked
  3. iPhone 16 Pro Max a Year Later: Still Worth Buying In 2025?
  1. Flipkart Big Billion Days Sale: iPhone 16 Pro Max Price to Drop Under Rs. 1 Lakh
  2. Apple Faces Lawsuit Over Allegedly Training Its AI Models on Copyrighted Books
  3. Samsung Galaxy S26 Ultra and S26 Pro: First Leaked Renders Have Arrived and Here's What You Can Expect
  4. Blink Charging to Support Crypto Payments Across Entire EV Charging Network by 2025-End
  5. Apple's iPhone 17 Launch Spoiled by Case Leak: We Explain How They Do It
  6. Android Security Alert: Billions of Devices at High Risk, Warns CERT-In; Android 15, 16 Affected
  7. Samsung Galaxy S26 Edge CAD Renders Tease iPhone-Like Camera Island, Thinner Body: Report
  8. Who Is Amit Kshatriya: Indian-Origin Appointed as NASA’s Associate Administrator
  9. Astronomers Discover Stellar Graveyard Filled With Black Hole and Neutron Star Collisions
  10. Scientists Visualize New Gold Quantum Needles at Nanoscale
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.