OpenAI o3 and o4-Mini AI Models With Visual Reasoning Capabilities Released

OpenAI says o3 and o4-mini can agentically use and combine every tool within ChatGPT.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 17 April 2025 18:15 IST
Highlights
  • The o-series AI models can extract information from even imperfect images
  • OpenAI’s o3 and o4-mini outperform GPT-4o and o1 in several benchmarks
  • OpenAI said the AI models might struggle with perception errors

The o3 and o4-mini AI models are rolling out to ChatGPT’s paid subscribers

Photo Credit: Reuters

OpenAI released two new artificial intelligence (AI) models on Wednesday. Dubbed o3 and o4-mini, these are the company's latest reasoning-focused models with visible chain-of-thought (CoT). The San Francisco-based AI firm stated that these models come with visual reasoning capability, which means they can analyse and “think” about an image to answer more complex user queries. Successor to the o1 and o3-mini, these models will currently be available to the paid subscribers of ChatGPT. Notably, the company also released the GPT-4.1 series of AI models earlier this week.

OpenAI's New Reasoning Models Arrive With Improved Performance

In a post on X (formerly known as Twitter), the official handle of OpenAI announced the release of the new large language models (LLMs). Calling them the company's “smartest and most capable models,” the AI firm highlighted that these models now come with visual reasoning capability.

Advertisement

Visual reasoning essentially means that these AI models can better analyse images to extract contextual and implicit information from them as well. On its website, OpenAI said these are the first models from the company that can agentically use and combine every tool within ChatGPT. These include web search, Python, image analysis, file interpretation, and image generation.

This means the o3 and o4-mini AI models can look up the image on the web, manipulate the image by zooming, cropping, flipping, and enhancing them, and even run a Python code to extract information. OpenAI said this would allow the models to find information even from imperfect images.

Advertisement

Some of the tasks these models can now perform include reading handwriting from a notebook that's upside down, reading a faraway sign with barely readable text, recognising a particular question from a large list, finding a bus schedule from the picture of a bus, solving a puzzle, and more.

Coming to the performance, OpenAI claimed that the o3 and o4-mini AI models outperform GPT-4o and o1 models on the MMMU, MathVista, VLMs are blind, and CharXiv benchmarks. The company did not share any performance comparisons with third-party AI models.

Advertisement

OpenAI also highlighted several limitations of these models. The AI models could perform unnecessary image manipulation steps and tool calls to cause overly long chains of thought. The o3 and o4-mini are also susceptible to perception errors, and they can misinterpret visual information to give incorrect responses. Further, the AI firm highlighted that the models might also have reliability-related issues.

Both o3 and o4-mini AI models are being made available to ChatGPT Plus, Pro, and Team users. They will replace the o1, o3-mini, and o3-mini-high models in the model selector. Enterprise and Edu users will get access to them next week. Developers can access the models via the Chat Completions and Responses application programming interfaces (APIs).

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Oppo Find X9 Ultra With 200-Megapixel Periscope Camera Launched Globally
  2. Vivo X300 FE Roundup: Expected Price in India, Specifications
  3. Motorola Edge 70 Fusion Review
  4. Oppo Find X9s Pro Launched With 200-Megapixel Cameras: See Price, Features
  5. Poco M8s 5G Debuts Globally With 7,000mAh Battery: See Price, Features
  6. Tim Cook to Step Down as Apple CEO as John Ternus Named Successor
  7. Apple's iOS 27 Update Might Drop Support for These iPhone Models
  1. NASA Shuts Down Voyager 1 Instrument to Extend Mission Life in Deep Space
  2. Oppo Enco Clip 2 With Open-Ear Design, Up to 40 Hours Total Battery Life Launched Alongside Oppo Watch X3 Mini
  3. Vivo Y6t Launched With 6,500mAh Battery, Snapdragon 4 Gen 2 SoC: Price, Specifications
  4. OCBC Partners Lion Global Investors and DigiFT to Launch Tokenised Gold Fund With GOLDX Token
  5. Oppo Pad 5 Pro Launched With 13,380mAh Battery, Snapdragon 8 Elite Gen 5 SoC Alongside Oppo Pad Mini: Price, Features
  6. Redmi K90 Max Launched With Dimensity 9500 SoC, 8,550mAh Battery and Active Cooling Fan: Price, Specifications
  7. Oppo Find X9 Ultra Launched With Snapdragon 8 Elite Gen 5 SoC, 200-Megapixel Periscope Camera: Price, Specifications
  8. Oppo Find X9s Pro Launched With 200-Megapixel Cameras, 7,025mAh Battery: Price, Specifications
  9. OnePlus Ace 6 Ultra Geekbench Listing Reveals MediaTek Dimensity 9500 Chip, 16GB RAM
  10. Motorola Edge 70 Pro+ Leaked Renders Hint at Design, Five Colour Options
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.