Claude Jailbroken by Chinese Hackers to Orchestrate First-of-Its-Kind AI Cyberattack

Anthropic said this is the first documented case of a large-scale cyberattack executed with minimal human intervention.

Advertisement
Written by Akash Dutta, Edited by Rohan Pal | Updated: 14 November 2025 16:47 IST
Highlights
  • The threat actors used jailbreaking techniques to manipulate Claude
  • The cyberattack targeted multiple large companies and government agencies
  • Anthropic said hackers completed 80-90 percent of attack using Claude

Anthropic banned the hackers’ accounts, notified the impacted entities, and coordinated with authorities

Photo Credit: Unsplash/Desola Lanre-Ologun

Claude was used for a large-scale agentic cyberattack in September, Anthropic admitted on Thursday. This attack was largely carried out by the artificial intelligence (AI) system with only minimal human intervention, making it the first-of-its-kind incident. The San Francisco-based AI firm claimed that the threat actor behind the operation was a Chinese state-sponsored group that targeted multiple large corporations and government agencies. Despite strict guardrails, the hackers were able to push Claude to perform the cyberattack by using jailbreaking techniques, the company stated.

The World's First Agentic AI-Driven Cyberattack Uses Anthropic's Claude

In a newsroom post, Anthropic made a startling disclosure that its large language model (LLM) platform, Claude Code, was manipulated by a Chinese state-sponsored adversary to carry out an agentic cyber-espionage campaign. The company shared the details of the case publicly to help stakeholders strengthen its cybersecurity measures and prepare for more such AI-driven attacks in the future.

The incident unfolded in mid-September 2025 when the threat actor “jail-broke” Claude by breaking its guardrails. They did this by decomposing their instructions into seemingly benign subtasks, presenting the model with the fake identity of a legitimate cybersecurity contractor. Once trust was established, Claude was used as an autonomous tool, scanning target networks, writing exploit code, harvesting credentials, extracting data and producing documentation of the hack. Humans were involved only at a handful of critical decision-points (estimated four to six per campaign).

Advertisement

The report indicates roughly 30 global targets across technology firms, financial institutions, chemical-manufacturing companies and government agencies. In some cases, infiltration succeeded. Crucially, the bulk of the work, around 80-90 percent, was undertaken by the AI model itself.

Advertisement

The distinguishing element here is the model's autonomous role. While previous cyber-incidents have involved AI in support of human hackers, this is the first documented case in which a model executed a large-scale operation with minimal human intervention. Anthropic highlighted that advanced models today have grown sophisticated enough to carry out such attacks, and the agentic ability to invoke external tools only multiplies this ability.

Anthropic warns that the lowering of barriers to entry for high-end cyberattacks is now real. Even less-resourced adversaries could now use agentic models to scale operations. The firm highlighted the need for improved detection systems, threat-sharing across industry and government, and strong safety controls built into AI platforms.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. OTT Releases of the Week: Dude, Nishaanchi, Jolly LLB 3, and More
  2. OnePlus 15R Confirmed to Launch Soon: Know Expected Features
  3. Oppo Find X9 Series Could Launch in India at This Price
  4. OnePlus 15 Review
  5. Google Could Release Gemini 3 Pro AI Model Alongside Nano Banana 2
  6. Oppo Reno 15 Series to Launch in These Storage Variants, Colourways
  7. ChatGPT Will Now Let You Create a Group Chat With Your Friends
  8. Itel Launches 128GB Variant of Itel A90 Limited Edition in India
  1. Google Expands Native Call Recording to Older Pixel Phones With Latest Update
  2. Google DeepMind Introduces SIMA 2, a Gemini-Powered AI Agent That Can Play Video Games
  3. Vivo S50 Series Tipped to Launch Next Month With a Snapdragon Chip
  4. Qualcomm Unveils Dragonwing IQ-X Series Industrial Chipsets, Supports AI Workflows for Smart Industries
  5. Vivo X300 Series Specs Confirmed, India-Exclusive Red Colour Teased
  6. Scammers Exploit Australia’s Cybercrime Portal to Impersonate Police and Steal Crypto, AFP Warns
  7. Ubisoft Delays Earnings Release on Due Date, Requests Trading of Its Shares Be Halted
  8. Claude Jailbroken by Chinese Hackers to Orchestrate First-of-Its-Kind AI Cyberattack
  9. Oppo Reno 15 Series Storage Variants, Colourways Revealed Ahead of China Launch
  10. Centre Notifies DPDP Rules 2025, RTI Amendment 2025 Comes Into Force
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.