Anthropic Releases Claude 4 Series AI Models With Improved Coding Capability and Tool Use

Anthropic said Claude Sonnet 4 achieved state-of-the-art (SOTA) on the SWE-Bench benchmark with a score of 72.7 percent.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 23 May 2025 12:18 IST
Highlights
  • Anthropic also made Claude Code generally available
  • Claude Sonnet 4 is available to those on the free tier
  • Opus 4 comes with improvements in memory and tool use

Both Claude 4 models feature two modes — near instant responses and an Extended Thinking mode

Photo Credit: Anthropic

Anthropic introduced Claude 4 artificial intelligence (AI) models at its inaugural developer conference on Thursday. The San Francisco-based AI firm unveiled Claude Opus 4 and Claude Sonnet 4 models, and announced new capabilities including Extended Thinking with tool use. Opus 4 is said to be state-of-the-art (SOTA) in coding, tool use, and writing. Additionally, Claude Code is now generally available, and individuals can find its beta extensions in VS Code and JetBrains. It is also among the models available on GitHub.

Anthropic Unveils Claude 4 AI Models

In a newsroom post, the AI firm detailed the new models as well as the new features it is rolling out across its chatbot and application programming interface (API). Anthropic's latest large language models (LLMs) put a heavy focus on coding capabilities and agentic functions.

Advertisement

Both Opus 4 and Sonnet 4 are hybrid models with two modes: near-instant responses and Extended Thinking for deeper reasoning. Opus 4 is the company's flagship-tier AI model. Calling it “the best coding model in the world,” Anthropic claimed that it scored 72.5 percent on the SWE-Bench and 43.2 percent on the Terminal-Bench benchmarks. Both of these benchmarks measure the coding capabilities of a model.

Claude 4 models' performance on the SWE-Bench
Photo Credit: Anthropic

Advertisement

 

Similarly, Claude Sonnet 4 is said to be significantly improved compared to its predecessor. Based on internal evaluation, the company claimed it scored 72.7 percent on SWE-Bench (SOTA). While it falls short of Opus 4's score in other domains, Anthropic says the model balances performance and efficiency better than the flagship LLM.

Advertisement

Apart from performance-based improvements, Claude Opus 4 can maintain long-term task awareness with improvements in its memory. Anthropic has also fixed the issue where models take a shortcut or find a loophole to complete a task. During extended thinking, both models can use tools. This will allow the models to alternate between native reasoning and exploring external information (such as web search) to improve responses. Other improvements include the ability to use tools in parallel and greater prompt adherence.

Currently, the Opus 4 and Sonnet 4 models with both modes are available to Claude Pro, Max, Team, and Enterprise subscribers. Sonnet 4 is also available to the free users. Additionally, developers can access these LLMs via the Anthropic API, as well as on Amazon Bedrock and Google Cloud's Vertex AI. The company said the pricing is being kept the same as the previous generation.

Advertisement

Opus 4 will cost developers $15 (roughly Rs. 1,290) per million of input tokens and $75 (roughly Rs. 6,440) per million of output tokens. On the other hand, Sonnet 4 is priced at $3 (roughly Rs. 260) per million input, and $15 (roughly Rs. 1,290) per million output tokens.

Beyond the new AI models, Anthropic also announced new features, and made Claude Code generally available. First introduced in February as a research preview, it is an agentic coding tool that can perform a wide range of coding tasks. Beta extension of the feature is now available in VS Code and JetBrains. Additionally, the company is also releasing a Claude Code software development kit (SDK), which is available in beta on GitHub.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Best Mobiles Under Rs. 40,000 in India
  2. OnePlus Pad 4 to Launch in India With a 13,380mAh Battery on This Date
  3. Realme Buds T500 Pro Debut in India With Up to 56 Hours Total Battery Life
  4. YouTuber Demonstrates Flaw That Allows Money to Be Stolen From Locked iPhone
  5. Motorola Razr 70 Ultra Specifications Surface via Certification Site
  6. OnePlus Nord CE 6 Lite Appears on Geekbench With This MediaTek Chip
  1. OnePlus Nord CE 6 Lite Appears on Geekbench With Dimensity 7400 Chip, Android 16
  2. Meta’s Planned Facial Recognition Feature for Smart Glasses Faces Opposition From Privacy Orgs
  3. Vivo X300 Ultra Pricing Surfaces Online via Retail Listing in Europe
  4. YouTube's New Option Lets Users Effectively Turn Off Shorts From Their Feed
  5. South Korea Plans Blockchain-Based Payments for Government Spending
  6. Amazon Launches AI Store to Help Users Discover and Shop AI-Powered Devices
  7. Motorola Razr Fold, Lenovo Legion Y70 to Launch Alongside Y900 Tablet During Lenovo's May 19 Event
  8. Apple Tap-to-Pay Vulnerability Demonstrated on Video as YouTuber Steals $10,000 From a Locked iPhone
  9. Adobe’s New Firefly AI Assistant Can Perform Complex Design Tasks With Text Prompts
  10. Crimson Desert Has Sold Over 5 Million Copies, Pearl Abyss Confirms
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.