OpenAI Unveils GPT-4 Turbo With Vision Capabilities in API and ChatGPT

OpenAI claims the improved vision capabilities will allow JSON mode and function calling.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 10 April 2024 19:41 IST
Highlights
  • The GPT-4 Turbo with Vision has a context window of 1,28,000 tokens
  • The OpenAI AI model has a knowledge cut-off of December 2023
  • AI coding assistant Devin is powered by GPT-4 Turbo Vision

GPT-4 Turbo with Vision allows the model to take in images and answer questions about them

Photo Credit: Unsplash/Levart_Photographer

OpenAI announced a major improvement to its latest artificial intelligence (AI) model GPT-4 Turbo on Tuesday. The AI model now comes with computer vision capabilities, allowing it to process and analyse multimedia inputs. It can answer questions about an image, video, and more. The company also highlighted several AI tools which are powered by GPT-4 Turbo with Vision including the AI coding assistant Devin and Healthify's Snap feature. Last week, the AI firm introduced a new feature that would allow users to edit DALL-E 3 generated images within ChatGPT.

The announcement was made by the official account of OpenAI Developers, which said in an X (formerly known as Twitter) post, “GPT-4 Turbo with Vision is now generally available in the API. Vision requests can now also use JSON mode and function calling.” Later, the X account of OpenAI also revealed that the feature is now available in API and it is being rolled out in ChatGPT.

Advertisement

GPT-4 Turbo with Vision is essentially the GPT-4 foundation model with the higher token outputs introduced with the Turbo model, and it now comes with improved computer vision to analyse multimedia files. The vision capabilities can be used in a variety of methods. The end user, for instance, can use this capability by uploading an image of the Taj Mahal on ChatGPT, and asking it to explain what material the building is made up of. Developers can take this a step ahead and fine-tune the capability in their tools for specific purposes.

OpenAI highlighted some of these use cases in the post. Cognition AI's Devin chatbot, which is an AI-powered coding assistant, uses GPT-4 Turbo with Vision to see the complex coding tasks and its sandbox environment to create programmes.

Advertisement

Similarly, the Indian calorie tracking and nutrition feedback platform Healthify has a feature called Snap where users can click a picture of a food item or a cuisine, and the platform reveals the possible calories in it. With GPT-4 Turbo with Vision's capabilities, it now also recommends what the user should do to burn the extra calories or ways to reduce calories in the meal.

Notably, this AI model has a context window of 1,28,000 tokens and its training data runs up to December 2023.


Is the Samsung Galaxy Z Flip 5 the best foldable phone you can buy in India right now? We discuss the company's new clamshell-style foldable handset on the latest episode of Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Further reading: OpenAI, GPT 4, GPT, Artificial intelligence, AI
Advertisement

Related Stories

Popular Mobile Brands
  1. Xiaomi 17T Key Specifications Revealed Via Benchmarking Site
  2. Alien: Isolation 2 Teaser Revealed by Sega and Creative Assembly
  3. Apple Could Launch These New Devices Once John Ternus Takes Over
  4. Vivo X Fold 6 Could Bring These Upgrades Over the Current Model
  5. Sony Hikes PS5, PS5 Pro and PS Portal Prices Across These Regions
  1. Sony's PS5, PS5 Pro and PS Portal Get Price Hikes Across Southeast Asia; No Price Increase in India Yet
  2. Alien: Isolation 2 Gets Atmospheric Teaser From Creative Assembly and Sega
  3. Apple Said to Plan Launch of Foldable iPad, AI Smart Home Devices, Touchscreen MacBook Under John Ternus
  4. Vivo X500 Pro Max Tipped to Launch With 200-Megapixel Periscope Telephoto Camera, 50-Megapixel LOFIC Camera
  5. MediaTek Dimensity 7450 Chipset Unveiled Alongside Foldable-Ready Dimensity 7450X SoC
  6. Xiaomi 17T With Dimensity 8500 Chip, 12GB of RAM Surfaces on Geekbench Ahead of Launch
  7. Vivo X Fold 6 Could Feature 200-Megapixel Camera, Large Battery; Xiaomi Mix Fold 6 Launch Timeline Tipped
  8. A Visitor from Another Star: Interstellar Comet Reveals Alien Origins
  9. Jerax Season 1 OTT Release: Where to Watch This Kannada Fantasy Comedy Series
  10. Nukkad Naatak Now Streaming Online: Where to Watch Social Drama Online?
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.