Anthropic’s Claude Sonnet 4.5 AI Model Introduced as the ‘Best Coding Model in the World’

Anthropic claims that Claude Sonnet 4.5 scored 77.2 percent on the SWE bench benchmark, beating GPT-5 and Gemini 2.5 Pro.

Advertisement
Written by Akash Dutta, Edited by Ketan Pratap | Updated: 30 September 2025 13:43 IST
Highlights
  • Anthropic says the model shows better reasoning than Opus 4.1
  • Claude Sonnet 4.5 also improves agentic and computer use functions
  • It is available to all users, currently

Anthropic is also improving Claude Code, Claude API, and Claude for Chrome extension

Photo Credit: Anthropic

Anthropic released the Claude Sonnet 4.5 artificial intelligence (AI) model on Monday. Calling it the “best coding model in the world,” the company highlighted that it has been improved across multiple domains including, coding, agentic operations, computer use, reasoning, and domain-specific knowledge. The new model will be available across Claude website and mobile apps, Claude Code, the application programming interface (API), as well as the under-testing Claude for Chrome extension. Anthropic also claims that the AI model can work autonomously for 30 hours on a task.

Claude Sonnet 4.5 Features and Capabilities

In a blog post, the AI firm detailed the new AI model. Based on the company's claims, Claude Sonnet 4.5 is designed to make a massive leap in coding and agentic performance, although other areas have also been upgraded. However, despite the upgrades, the new model only offers incremental improvements and does not bring any new capability or modality.

Based on internal testing, Anthropic claims that the large language model (LLM) achieved a score of 77.2 percent on the SWE bench-Verified benchmark, which measures the agentic coding capabilities of a model. Notably, this is higher than what OpenAI's GPT-5 and Google's Gemini 2.5 Pro scored, or the company's Opus 4.1.

Advertisement

Claude Sonnet 4.5 demo

 

Gadgets 360 staff members briefly tested the model's coding capabilities. We asked it to create a WhatsApp-like messaging chatbot, complete with individual and group chats, as well as audio and video calls. In just two minutes, it wrote 436 lines of code in React, and was able to generate a preview for the fully-functional (minus the server connectivity) interface.

Advertisement

Other benchmarks where the AI model is said to have led the charts include Terminal Bench, OSWorld for Computer Use, AIME 2025 for high-school mathematics, and Finance Agent for financial analysis. In reasoning-based GPQA Diamond, Gemini 2.5 Pro fared better, while GPT-5 led the charts in MMMU benchmark for visual reasoning and MMLU for multilingual performance.

Anthropic also claimed that the model surpassed all the company's older models in domain-specific knowledge and reasoning across finance, law, medicine, and STEM fields.

Advertisement

Coming to safety, the AI firm claims that the Claude Sonnet 4.5 is its “most aligned frontier model.” Anthropic claims that it has reduced behaviours such as sycophancy, depeception, power-seeking, and the tendency to encourage delusional thinking. Safeguards have been taken to protect it from prompt injections as well.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. OnePlus 15R Confirmed to Launch Soon: Know Expected Features
  2. OTT Releases of the Week: Dude, Nishaanchi, Jolly LLB 3, and More
  3. Oppo Find X9 Series Could Launch in India at This Price
  4. Oppo Reno 15 Series to Launch in These Storage Variants, Colourways
  5. Marvel Spidey and Iron Man: Avengers Team Up Now Streaming on JioHotstar
  6. Vivo X300 Series Specs Confirmed, India-Exclusive Red Colour Teased
  7. Google Could Release Gemini 3 Pro AI Model Alongside Nano Banana 2
  8. Samsung Silently Introduces Galaxy Book 5 Edge 5G With These Features
  9. Qualcomm Unveils Dragonwing IQ-X Series Industrial Chipsets
  10. Spotify Brings New Premium Plans to India at These Prices
  1. Google Expands Native Call Recording to Older Pixel Phones With Latest Update
  2. Google DeepMind Introduces SIMA 2, a Gemini-Powered AI Agent That Can Play Video Games
  3. Vivo S50 Series Tipped to Launch Next Month With a Snapdragon Chip
  4. Qualcomm Unveils Dragonwing IQ-X Series Industrial Chipsets, Supports AI Workflows for Smart Industries
  5. Vivo X300 Series Specs Confirmed, India-Exclusive Red Colour Teased
  6. Scammers Exploit Australia’s Cybercrime Portal to Impersonate Police and Steal Crypto, AFP Warns
  7. Ubisoft Delays Earnings Release on Due Date, Requests Trading of Its Shares Be Halted
  8. Claude Jailbroken by Chinese Hackers to Orchestrate First-of-Its-Kind AI Cyberattack
  9. Oppo Reno 15 Series Storage Variants, Colourways Revealed Ahead of China Launch
  10. Centre Notifies DPDP Rules 2025, RTI Amendment 2025 Comes Into Force
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.