ChatGPT Updated With Support for Voice Conversation, Image Recognition Features: Details

ChatGPT will soon be able to 'discuss' images shared by users and hold 'back-and-forth' conversations using five voices.

Advertisement
Written by David Delima, Edited by Siddharth Suvarna | Updated: 26 September 2023 12:56 IST
Highlights
  • ChatGPT will soon be able to converse using text-to-speech technology
  • The chatbot is also gaining the ability to analyse and understand images
  • These ChatGPT features will roll out to users in the coming weeks

ChatGPT Plus and Enterprise subscribers will have access to the features in the coming weeks

Photo Credit: Unsplash/ @ilgmyzin

ChatGPT has been updated with support for voice conversations and image recognition, OpenAI announced on Monday. The company's AI-powered chatbot will soon be able to understand images captured or shared by users and provide details or related information across platforms where the chatbot is available. It will also be capable of back-and-forth conversation using OpenAI's Whisper speech recognition tool and a new text-to-speech (TTS) technology from the company that is claimed to offer "human-like" audio on the company's ChatGPT app for smartphones.

OpenAI revealed in a blog post that the company's new image recognition capability for ChatGPT will be available on all platforms, while the voice conversations feature will be available on iOS and Android via an opt-in setting. These features will be available to ChatGPT Plus and Enterprise subscribers, and there's no word on whether it will roll out to users on the free tier in the future.

The voice conversations coming to ChatGPT can be enabled by going to Settings > New Features and toggling the option to enable voice conversations. You can then select from five voices — OpenAI says it has worked with professional voice actors to offer the new feature. The ChatGPT app will be able to answer questions by converting your spoken queries into text that can be understood by the chatbot, and responses will be turned into audio using the company's new TTS technology.

Advertisement

ChatGPT isn't the only service that will use OpenAI's new TTS technology — Spotify on Monday announced a new AI-based voice translation tool for podcast creators that can automatically translate a podcast from English to French, German, and Spanish. The tool is being tested with a few podcast hosts and translated episodes will be available to all users wherever Spotify is available, according to the streaming platform. 

Advertisement

OpenAI says the new image recognition tool runs on the company's multimodal GPT-3.5 and GPT-4 models and are capable of analysing images and text contained in photographs, screenshots, and documents. Users can either capture an image or share an existing one on their phone with ChatGPT to get insights from the chatbot.

ChatGPT will also allow users to share multiple images that can be discussed with the chatbot, according to OpenAI. If you want it to focus on a specific area, the built-in drawing tool will allow you to mark a part of the image. For example, drawing around a dislodged bicycle chain in a photo shared with ChatGPT might allow the chatbot to show you ways to fix the problem.

Advertisement


Is the Samsung Galaxy Z Flip 5 the best foldable phone you can buy in India right now? We discuss the company's new clamshell-style foldable handset on the latest episode of Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. Apple's M5-Powered MacBook Pro 14-inch, iPad Pro Now Available in India
  2. DeepSeek-OCR Could Change How AI Reads Text From Images
  3. These Are the 5 Biggest OxygenOS 16 Features You Should Know About
  4. OnePlus 15 Camera Details Revealed Ahead of October 27 Launch
  5. iQOO Neo 11 Key Specifications Tipped Ahead of Launch in China
  6. Samsung Galaxy XR Headset Launched With Hand Tracking at This Price
  7. OnePlus 15 India Launch Teased; Key Features Revealed Ahead of Launch
  8. YouTube's New Tool Will Detect Deepfakes of Content Creators
  9. JioSaavn Announces 'Limited-Time' Annual Plan: Price, Benefits
  10. Diwali Blackout: How the AWS Outage Crippled Major Apps Across the World
  1. Apple's 18-Inch Foldable iPad Said to Be Delayed; Could Launch in 2029 With Hefty Price Tag
  2. Bitcoin Slumps to $108,000 as Traders Await US CPI Data
  3. Amazon Is Reportedly Planning to Replace Half a Million Workers With Robots and Automation
  4. OnePlus 15 Camera Details Revealed; Confirmed to Sport 50-Megapixel Periscope Telephoto Lens
  5. Nintendo to Host a Second Kirby Air Riders Direct Presentation This Week
  6. Samsung Galaxy S26 Series Could Feature Exynos 2600 With Faster NPU Performance Than Apple's A19 Pro Chip
  7. YouTube Launches Likeness Detection Tool to Protect Creators from AI-Generated Deepfakes
  8. JioSaavn Announces ‘Limited-Time’ Annual Plan With Ad-Free Music Streaming, Offline Playback
  9. Redmi K90 Design, Key Features Revealed; Confirmed to Debut on October 23 With 7,100mAh Battery
  10. Oppo Reno 15 Pro Max Tipped to Launch With 200-Megapixel Triple-Rear Camera Unit
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.