Anthropic Introduces Claude Opus 4.6 AI Model With Improved Agentic Performance

Anthropic’s Claude Opus 4.6 AI model brings improved coding, long-context handling, and agentic workflows.

Advertisement
Written by Akash Dutta, Edited by Rohan Pal | Updated: 6 February 2026 13:48 IST
Highlights
  • Claude Opus 4.6 gets a context window of 1 million tokens
  • It outperforms Claude Opus 4.5 on benchmarks like Terminal-Bench 2.0
  • The new model has safety features for cybersecurity

Claude Opus 4.6 powers the new agent teams feature in Claude Code

Photo Credit: Anthropic

Anthropic released the Claude Opus 4.6 artificial intelligence (AI) model on Thursday, bringing a major update to its Opus 4 foundation model. The new model addresses key limitations in the predecessor by improving sustained performance across complex tasks, particularly in software engineering and knowledge-intensive domains. The Claude Opus 4.5 model was adept at handling advanced reasoning-based tasks, but often struggled with long-horizon contexts and edge cases in large databases. A beta version of Opus 4.6 fixes that with a context window of one million tokens.

Claude Opus 4.6 AI Model Is Here

In a newsroom post, Anthropic announced and detailed the latest frontier model. Marking a first for the Opus lineup, a beta version of the model now supports a context window of up to one million tokens, allowing it to process vast amounts of information while minimising performance degradation during long interactions. This is a step up from the 2,00,000 token limits in earlier Opus models.

Advertisement

Claude Opus 4.6 also comes with new features like context compaction that summarise and refresh older data during prolonged tasks. The architecture incorporates adaptive thinking, where the model assesses query complexity to allocate deeper reasoning as needed, and effort controls ranging from low to max for optimising speed, intelligence and cost.

On benchmarks, Claude Opus 4.6 sets new highs. Based on internal evaluations from the company, the post claims that the AI model leads frontier models on Terminal-Bench 2.0 for command-line proficiency, and Humanity's Last Exam for multidisciplinary reasoning. In agentic evaluations like GDPval-AA, it surpasses OpenAI's GPT-5.2 by about 144 Elo points and its own Opus 4.5 by 190 points, focusing on finance and legal tasks. SWE-bench Verified scores average 81.42 percent with optimised prompting, while CyberGym tests show strong no-thinking baseline performance.

Advertisement

Anthropic highlights that safeguards remain a core priority with the new model. It is said to match or exceed peers in safety audits, with low rates of deception or sycophancy and the lowest over-refusal tendencies among recent releases. The company said it added six new cybersecurity probes to detect potential misuse, accelerating defensive applications like vulnerability hunting in open-source code.

In coding, Opus 4.6 manages large repositories autonomously, conducts code reviews and debugs with high accuracy. It also assembles agent teams for parallel development via Claude Code's research preview. For business workflows, it runs financial analyses, generates documents and handles multi-step searches in tools like Claude in Excel, now upgraded for unstructured data and long-running tasks. A research preview for Claude in PowerPoint extends this to presentations. In domains like computational biology, it delivers nearly double the performance of Opus 4.5, aiding scientific discovery.

Advertisement

Claude Opus 4.6 is available now via the website, mobile and desktop apps, Anthropic's application programming interface (API), and major cloud providers. API pricing starts at $5 (roughly Rs. 453) per million input tokens and $25 (roughly Rs. 2,300) for output, with premiums for extended contexts.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement
Popular Mobile Brands
  1. Samsung Galaxy S26 FE Korean Variant Surfaces on Geekbench
  2. Here's What Apple Could Launch Over the Next Two Years
  3. Oppo Reno 16 Series to Launch in India Soon Alongside These Accessories
  4. Sennheiser HD 480 Pro Plus Review
  1. Samsung Galaxy Watch 9, Watch Ultra 2 Leak Shows New Design, Colours and Features
  2. Oppo Reno 16 Series India Launch Teased; Could Arrive Alongside Oppo Bubble Accessory and New Enco Series Headset
  3. Samsung Galaxy S26 FE Korean Variant Spotted on Geekbench With Exynos 2500 SoC, 8GB RAM
  4. Apple’s Future Product Roadmap to Reportedly Include Foldable iPhone, AI Glasses, and Camera AirPods
  5. JWST Reveals Salt Clouds in the Atmosphere of the ‘Pink Planet’ GJ 504b
  6. The East Palace 2026 OTT Release Date: When and Where to Watch it Online?
  7. Manithan Deivamagalam Now Available For Streaming Online: What You Need to Know
  8. Cup: Love All Play OTT Release Date: Know When and Where to Watch it Online
  9. Sugar Season 2 OTT Release: Where to Watch Colin Farrell’s Sci-Fi Thriller Series
  10. IceCube Traces High-Energy Neutrino to Distant Starburst Galaxy
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.