ChatGPT Advanced Voice Mode With Vision Rolling Out to Paid Subscribers

The real-time video feature in ChatGPT will let the AI access a smartphone’s camera to process the visual information.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 13 December 2024 15:52 IST
Highlights
  • The vision feature is available to ChatGPT Plus, Team, and Pro users
  • ChatGPT vision is only available via the mobile apps
  • The feature will be rolled out to Enterprise and Edu users in early 2025
ChatGPT Advanced Voice Mode With Vision Rolling Out to Paid Subscribers

ChatGPT users can use the new Screenshare feature to send feedback on their experience

Photo Credit: OpenAI

OpenAI rolled out the Advanced Voice Mode with Vision feature in ChatGPT on Thursday. The feature, which lets the artificial intelligence (AI) chatbot access the smartphone's camera to capture visual information of the user's surrounding, will be available to all ChatGPT Plus, Team and Pro subscribers. The feature draws on the capabilities of GPT-4o and can provide real-time voice responses on what is being shown in the camera. Vision in ChatGPT was first unveiled in May during the company's Spring Updates event.

ChatGPT Gets Vision Capabilities

The new ChatGPT feature was rolled out on day six of OpenAI's 12-day feature release schedule. The AI firm has so far released the full version of the o1 model, the video generation Sora model, and a new Canvas tool. Now, with the Advanced Voice mode with Vision, users can let the AI see their surroundings and ask questions based on them.

In a demonstration, the OpenAI team members interacted with the chatbot with the camera on, and introduced several people. After that, the AI could answer a quiz on those people even when they were not actively on the screen. This highlights that the vision mode also comes with memory, although the company did not specify how long the memory lasts.

Users can use the ChatGPT vision feature to show the AI their fridge and ask for recipes or by showing their wardrobe and asking for outfit recommendations. They can also show the AI a landmark outside and ask questions about it. This feature is paired with the chatbot's low latency and emotive Advanced Voice mode, making it easier for users to interact in natural language.

Advertisement

Once the feature rolls out to users, they can go to the mobile app of ChatGPT and tap on the Advanced Voice icon. In the new interface, they will now see a video option, tapping which will give the AI access to the user's camera feed. Additionally, a Screenshare feature is also available which can be accessed by tapping the three dot menu.

Screenshare feature will enable the AI to see the user's device and any app or screen they go to. This way, the chatbot can also help users with smartphone-related issues and queries. Notably, OpenAI said that all Team subscribers will get access to the feature within the next week in the latest version of the ChatGPT mobile app.

Advertisement

Most Plus and Pro users will also get the feature, however, users in the European Union region, Switzerland, Iceland, Norway, and Liechtenstein will not get it at present. On the other hand, Enterprise and Edu users will get access to ChatGPT's Advanced Voice with Vision in eary 2025.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. Apple Announces iOS 26 With Liquid Glass Design, These New Features
  2. iQOO 13 and More Available With Discounts During iQOO 5th Anniversary Sale
  3. Poco F7 India Launch Teased; Flipkart Availability Confirmed
  4. WWDC 2025 Highlights: Apple Unveils iOS 26, macOS 26 and Liquid Glass UI
  5. Everything We Know About the Vivo T4 Ultra Ahead of Its June 11 Launch
  6. AI+ Smartwatch With Built-in TWS Launching This Month: Report
  7. Oppo K13x 5G Price Range in India, Retail Box Leaked Online
  8. Apple Intelligence Will Now Provide Live Translations on Your iPhone
  1. WWDC 2025: watchOS 26 Offers AI Workout Buddy, Wrist Flick Gesture, Liquid Glass Design, and More
  2. WWDC 2025: Apple Unveils iPadOS 26 With New Windowing System, Liquid Glass UI, and More
  3. WWDC 2025: macOS Tahoe 26 Unveiled With New Design, Continuity Features and Big Update to Spotlight
  4. WWDC 2025: Apple Announces iOS 26 With New Liquid Glass Design, Apple Intelligence Enhancements and More
  5. WWDC 2025: Apple Intelligence Models Expanded to Developers, Live Translation Feature Unveiled
  6. Xbox Chief Phil Spencer Hints at 'Return' of Halo: Combat Evolved Next Year
  7. Vivo X Fold 5 Design Teased; Confirmed to Feature 8T LTPO Panels, Meet IP5X and IPX9+ Certifications
  8. Oppo K13x 5G Price Range in India Tipped; Alleged Retail Box Suggests Flat Display
  9. WWDC 2025: Apple Faces AI, Regulatory Challenges As it Woos Developers at Annual Conference
  10. WazirX Parent Zettai Urges Singapore Court to Review WazirX Restructuring, Extend Moratorium
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.