• Home
  • Ai
  • Ai News
  • Anthropic Introduces Claude Opus 4.6 AI Model With Improved Agentic Performance

Anthropic Introduces Claude Opus 4.6 AI Model With Improved Agentic Performance

Anthropic’s Claude Opus 4.6 AI model brings improved coding, long-context handling, and agentic workflows.

Anthropic Introduces Claude Opus 4.6 AI Model With Improved Agentic Performance

Photo Credit: Anthropic

Claude Opus 4.6 powers the new agent teams feature in Claude Code

Click Here to Add Gadgets360 As A Trusted Source As A Preferred Source On Google
Highlights
  • Claude Opus 4.6 gets a context window of 1 million tokens
  • It outperforms Claude Opus 4.5 on benchmarks like Terminal-Bench 2.0
  • The new model has safety features for cybersecurity
Advertisement

Anthropic released the Claude Opus 4.6 artificial intelligence (AI) model on Thursday, bringing a major update to its Opus 4 foundation model. The new model addresses key limitations in the predecessor by improving sustained performance across complex tasks, particularly in software engineering and knowledge-intensive domains. The Claude Opus 4.5 model was adept at handling advanced reasoning-based tasks, but often struggled with long-horizon contexts and edge cases in large databases. A beta version of Opus 4.6 fixes that with a context window of one million tokens.

Claude Opus 4.6 AI Model Is Here

In a newsroom post, Anthropic announced and detailed the latest frontier model. Marking a first for the Opus lineup, a beta version of the model now supports a context window of up to one million tokens, allowing it to process vast amounts of information while minimising performance degradation during long interactions. This is a step up from the 2,00,000 token limits in earlier Opus models.

Claude Opus 4.6 also comes with new features like context compaction that summarise and refresh older data during prolonged tasks. The architecture incorporates adaptive thinking, where the model assesses query complexity to allocate deeper reasoning as needed, and effort controls ranging from low to max for optimising speed, intelligence and cost.

On benchmarks, Claude Opus 4.6 sets new highs. Based on internal evaluations from the company, the post claims that the AI model leads frontier models on Terminal-Bench 2.0 for command-line proficiency, and Humanity's Last Exam for multidisciplinary reasoning. In agentic evaluations like GDPval-AA, it surpasses OpenAI's GPT-5.2 by about 144 Elo points and its own Opus 4.5 by 190 points, focusing on finance and legal tasks. SWE-bench Verified scores average 81.42 percent with optimised prompting, while CyberGym tests show strong no-thinking baseline performance.

Anthropic highlights that safeguards remain a core priority with the new model. It is said to match or exceed peers in safety audits, with low rates of deception or sycophancy and the lowest over-refusal tendencies among recent releases. The company said it added six new cybersecurity probes to detect potential misuse, accelerating defensive applications like vulnerability hunting in open-source code.

In coding, Opus 4.6 manages large repositories autonomously, conducts code reviews and debugs with high accuracy. It also assembles agent teams for parallel development via Claude Code's research preview. For business workflows, it runs financial analyses, generates documents and handles multi-step searches in tools like Claude in Excel, now upgraded for unstructured data and long-running tasks. A research preview for Claude in PowerPoint extends this to presentations. In domains like computational biology, it delivers nearly double the performance of Opus 4.5, aiding scientific discovery.

Claude Opus 4.6 is available now via the website, mobile and desktop apps, Anthropic's application programming interface (API), and major cloud providers. API pricing starts at $5 (roughly Rs. 453) per million input tokens and $25 (roughly Rs. 2,300) for output, with premiums for extended contexts.

Comments

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Akash Dutta
Akash Dutta is a Chief Sub Editor at Gadgets 360. He is particularly interested in the social impact of technological developments and loves reading about emerging fields such as AI, metaverse, and fediverse. In his free time, he can be seen supporting his favourite football club - Chelsea, watching movies and anime, and sharing passionate opinions on food. More
Oppo Find X9s, Poco X8 Series to Be First Smartphones to Launch in India With MediaTek Dimensity 9500s Chip

Advertisement

Follow Us

Advertisement

© Copyright Red Pixels Ventures Limited 2026. All rights reserved.
Trending Products »
Latest Tech News »