Anthropic Releases Claude 4 Series AI Models With Improved Coding Capability and Tool Use

Anthropic said Claude Sonnet 4 achieved state-of-the-art (SOTA) on the SWE-Bench benchmark with a score of 72.7 percent.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 23 May 2025 12:18 IST
Highlights
  • Anthropic also made Claude Code generally available
  • Claude Sonnet 4 is available to those on the free tier
  • Opus 4 comes with improvements in memory and tool use

Both Claude 4 models feature two modes — near instant responses and an Extended Thinking mode

Photo Credit: Anthropic

Anthropic introduced Claude 4 artificial intelligence (AI) models at its inaugural developer conference on Thursday. The San Francisco-based AI firm unveiled Claude Opus 4 and Claude Sonnet 4 models, and announced new capabilities including Extended Thinking with tool use. Opus 4 is said to be state-of-the-art (SOTA) in coding, tool use, and writing. Additionally, Claude Code is now generally available, and individuals can find its beta extensions in VS Code and JetBrains. It is also among the models available on GitHub.

Anthropic Unveils Claude 4 AI Models

In a newsroom post, the AI firm detailed the new models as well as the new features it is rolling out across its chatbot and application programming interface (API). Anthropic's latest large language models (LLMs) put a heavy focus on coding capabilities and agentic functions.

Advertisement

Both Opus 4 and Sonnet 4 are hybrid models with two modes: near-instant responses and Extended Thinking for deeper reasoning. Opus 4 is the company's flagship-tier AI model. Calling it “the best coding model in the world,” Anthropic claimed that it scored 72.5 percent on the SWE-Bench and 43.2 percent on the Terminal-Bench benchmarks. Both of these benchmarks measure the coding capabilities of a model.

Claude 4 models' performance on the SWE-Bench
Photo Credit: Anthropic

Advertisement

 

Similarly, Claude Sonnet 4 is said to be significantly improved compared to its predecessor. Based on internal evaluation, the company claimed it scored 72.7 percent on SWE-Bench (SOTA). While it falls short of Opus 4's score in other domains, Anthropic says the model balances performance and efficiency better than the flagship LLM.

Advertisement

Apart from performance-based improvements, Claude Opus 4 can maintain long-term task awareness with improvements in its memory. Anthropic has also fixed the issue where models take a shortcut or find a loophole to complete a task. During extended thinking, both models can use tools. This will allow the models to alternate between native reasoning and exploring external information (such as web search) to improve responses. Other improvements include the ability to use tools in parallel and greater prompt adherence.

Currently, the Opus 4 and Sonnet 4 models with both modes are available to Claude Pro, Max, Team, and Enterprise subscribers. Sonnet 4 is also available to the free users. Additionally, developers can access these LLMs via the Anthropic API, as well as on Amazon Bedrock and Google Cloud's Vertex AI. The company said the pricing is being kept the same as the previous generation.

Advertisement

Opus 4 will cost developers $15 (roughly Rs. 1,290) per million of input tokens and $75 (roughly Rs. 6,440) per million of output tokens. On the other hand, Sonnet 4 is priced at $3 (roughly Rs. 260) per million input, and $15 (roughly Rs. 1,290) per million output tokens.

Beyond the new AI models, Anthropic also announced new features, and made Claude Code generally available. First introduced in February as a research preview, it is an agentic coding tool that can perform a wide range of coding tasks. Beta extension of the feature is now available in VS Code and JetBrains. Additionally, the company is also releasing a Claude Code software development kit (SDK), which is available in beta on GitHub.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. This Is How Samsung's Wide-Folding Handset Might Look Like in Real Life
  2. Microsoft Unveils Surface Laptop Ultra as Its Most Powerful Laptop to Date
  3. Aspire X 16 AI, Aspire 18 AI Debut Alongside New All-in-One Desktops
  4. HP OmniBook X 14, Ultra 16 Refreshed With Nvidia RTX Spark 'Superchip'
  5. Acer Predator Helios 18 AI (2026) Debuts With an Intel Core Ultra 9 CPU
  6. Dell XPS 13 Refreshed With Intel Panther Lake CPUs to Rival MacBook Neo
  7. Google Drive's Document Scanner Gets Updated With These New Features
  8. Apple's Meta Ray-Ban Rivalling Smart Glasses Delayed Till Next Year
  9. Acer Swift Air 14 Launched With Intel Core Series 3 CPU, Lightweight Design
  10. Fable Delayed to February 2027 to Avoid Clash With GTA 6 Release
  1. HP OmniBook Ultra 16 (2026), OmniBook X 14 (2026) Unveiled With Nvidia's RTX Spark 'Superchip'
  2. Acer Swift Air 14 Launched With Intel Core Series 3 CPU, Lightweight Design at Computex 2026
  3. Microsoft Surface Laptop Ultra Announced With Blackwell RTX GPU, Nvidia RTX Spark Superchip
  4. Acer Aspire X 16, Aspire 18 AI Copilot+ PCs Launched Alongside Aspire C27 AI, Aspire C24 AI All-in-One Desktops
  5. Acer Predator Helios 18 AI (2026) Launched With Intel Core Ultra 9 CPU, Up to Nvidia GeForce RTX 5090 GPU
  6. Computex 2026: Samsung Display to Showcase 4K 360Hz QD-OLED, Handheld Gaming OLED Panels
  7. Unreleased Beats Headphones Spotted in Lamine Yamal's Instagram Post After Visiting US FCC Database
  8. Google Drive's Document Scanner Gets Major Refresh With Support for Detecting Duplicates, Multiple Page Scanning
  9. Dell XPS 13 (2026) Launched With 2.5K Display, Up to Intel Core Ultra 7 Series 3 Processor: Price, Specifications
  10. Xbox Delays Fable to February 2027 to Give It a Window 'All to Itself', Avoid Clash With GTA 6
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.