Google’s Updated Gemini 3 Deep Think Outperforms GPT-5.2 and Claude Opus 4.6

Google says Gemini 3 Deep Think was upgraded in partnership with scientists and researchers to tackle challenging research.

Advertisement
Written by Akash Dutta, Edited by Rohan Pal | Updated: 13 February 2026 15:54 IST
Highlights
  • The updated model is available to Google AI Ultra subscribers
  • Select researchers and enterprises can access the model via API
  • Gemini 3 Deep Think scored 84.6 percent on the ARC-AGI-2 benchmark

Gemini 3 Deep Think also scored 48.4 percent on Humanity’s Last Exam

Photo Credit: Google

Google, on Thursday, updates its Gemini 3 Deep Think artificial intelligence (AI) model. The frontier model was already the company's most intelligent model when it was launched in December 2025. Now, with this upgrade, Google says it can help scientists research challenging problems. The Mountain View-based tech giant highlighted that the update improves its performance across all major benchmarks, but most notably, the model sets new record on the ARC-AGI-2 and Humanity's Last Exam, outperforming both OpenAI's GPT-5.2 and Anthropic's Claude Opus 4.6.

Gemini 3 Deep Think Gets Upgraded

In a blog post, the tech giant said it is releasing a major upgrade to Gemini 3 Deep Think which will allow it to solve modern challenges across science, research, and engineering. The model continues to be available to the Google AI Ultra subscribers, but now, a select group of researchers and enterprises can also access it via the company's application programming interface (API).

Advertisement

Announcing the update, Google CEO Sundar Pichai said, “Gemini 3 Deep Think is getting a significant upgrade. We've refined Deep Think in close partnership with scientists and researchers to tackle tough, real-world challenges.” Elon Musk called the development “Impressive,” responding to the post.

With the improvement, the AI model is claimed to have scored 84.6 percent on the ARC-AGI-2 benchmark, which measures the reasoning capability of frontier models. Google claimed that the score was also verified by the ARC Prize Foundation. It also set a new record by scoring 48.4 percent (without tools) on Humanity's Last Exam, known for being the most difficult benchmark test in existence.

Advertisement

Additionally, the company claimed that Gemini 3 Deep Think also achieved Elo score of 3,455 on Codeforces. In each of these tests, the Google model is said to outperform frontier models from OpenAI and Anthropic.

Google also shared how some researchers are using the AI model in real-world scientific problems. It highlighted that Lisa Carbone, a mathematician at Rutgers University, used Gemini 3 Deep Think to review a highly technical mathematics paper. She observed that the model successfully identified a subtle logical flaw that had previously passed through human peer review unnoticed.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement
Popular Mobile Brands
  1. Leaked Screen Protector Points to the Fate of Apple's Dynamic Island
  2. Vivo X300 Ultra, Vivo X300s Launched With Zeiss-Tuned Cameras and Teleconverter Support
  3. Oppo Find X9 Ultra Listed on BIS Database, Might Launch in India Soon
  4. Vivo Pad 6 Pro Launched With 13-2-Inch 4K Display and This Snapdragon Chip
  1. Apple's iPhone 18 Pro Could Feature Smaller Dynamic Island Instead of Hole Punch Cutout, Leaked Screen Protector Suggests
  2. Vivo Pad 6 Pro Launched With 13.2-Inch 4K Display, Snapdragon 8 Elite Gen 5 Chip: Price, Specifications
  3. Vivo X300 Ultra With Snapdragon 8 Elite Gen 5 SoC Launched Alongside Vivo X300s: Price, Features
  4. Vi 5G Rollout: Telco Says It Will Expand 5G Coverage in 90 Cities Within Two Months
  5. Google Reportedly Working on AirDrop-Like Tap to Share Feature Discovered in One UI 9, Android 17 Builds
  6. OnePlus Ace 6 Ultra Tipped to Launch in April, Could Rival Redmi K90 Ultra
  7. Oppo Find X9 Ultra Gets One Step Closer to Launching in India as Handset Surfaces on BIS Database
  8. Vivo X300s Specifications Officially Confirmed; Will Feature 200-Megapixel Main Camera and 7,100mAh Battery
  9. Lava Bold N2 Pro 4G India Launch Date Set for March 31, Company Reveals Key Specifications
  10. Apple's New Siri App on iOS 27 Supports Text and Voice Modes, Adds 'Extensions' for Third-Party Chatbots: Gurman
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.