Anthropic Thwarts Hacker Attempts to Misuse Claude AI for Cybercrime

Anthropic's report said its internal systems had stopped the attacks and it was sharing the case studies to help others understand the risks.

Advertisement
By Reuters | Updated: 28 August 2025 18:19 IST
Highlights
  • Anthropic said it had banned the accounts involved, tightened its filters
  • AI tools are being increasingly exploited in cybercrime
  • Attackers attempted to use Claude to produce harmful content

Anthropic said it follows strict safety practices, including regular testing and outside reviews

Photo Credit: Anthropic

Anthropic said on Wednesday it had detected and blocked hackers attempting to misuse its Claude AI system to write phishing emails, create malicious code and circumvent safety filters.

The company's findings, published in a report, highlight growing concerns that AI tools are increasingly exploited in cybercrime, intensifying calls for tech firms and regulators to strengthen safeguards as the technology spreads.

Advertisement

Anthropic's report said its internal systems had stopped the attacks and it was sharing the case studies - showing how attackers had attempted to use Claude to produce harmful content - to help others understand the risks.

The report cited attempts to use Claude to draft tailored phishing emails, write or fix snippets of malicious code and sidestep safeguards through repeated prompting.

Advertisement

It also described efforts to script influence campaigns by generating persuasive posts at scale and helping low-skill hackers with step-by-step instructions.

The company, backed by Amazon.com and Alphabet, did not publish technical indicators such as IPs or prompts, but said it had banned the accounts involved and tightened its filters after detecting the activity.

Advertisement

Experts say criminals are increasingly turning to AI to make scams more convincing and to speed up hacking attempts. These tools can help write realistic phishing messages, automate parts of malware development and even potentially assist in planning attacks.

Security researchers warn that as AI models become more powerful, the risk of misuse will grow unless companies and governments act quickly.

Advertisement

Anthropic said it follows strict safety practices, including regular testing and outside reviews, and plans to keep publishing reports when it finds major threats.

Microsoft and SoftBank-backed OpenAI and Google have faced similar scrutiny over fears their AI models could be exploited for hacking or scams, prompting calls for stronger safeguards.

Governments are also moving to regulate the technology, with the European Union moving forward with its Artificial Intelligence Act and the United States pushing for voluntary safety commitments from major developers.

© Thomson Reuters 2025

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Further reading: Anthropic, Claude, AI, Cyberattack, Cybercrime
Advertisement

Related Stories

Popular Mobile Brands
  1. DJI Osmo Pocket 4 Debuts With 1-inch CMOS Sensor, Improved Stabilisation
  2. Best Mobiles Under Rs. 40,000 in India
  3. OnePlus Pad 4 to Launch in India With a 13,380mAh Battery on This Date
  4. Intel Launches Core Series 3 Processors With Up to 40 TOPS AI Compute
  5. YouTube Finally Lets You Turn Off Shorts From Your Feed With This Setting
  6. Huawei Watch Fit 5, Watch Fit 5 Pro Price, Specifications Leaked
  7. Oppo Find X10 Key Specifications Leak as Find X9 Ultra Launch Nears
  8. Vivo X300 Ultra Price Leaked: Here's How Much It Might Cost in Europe
  9. Samsung Galaxy A27 Renders Hint at This Notable Change to Its Display
  1. Apple Marketing Chief for Watch, AirPods, Home and Health Retires After 31 Years
  2. Huawei Watch Fit 5, Watch Fit 5 Pro Price and Features Leak Online Ahead of Anticipated Launch
  3. Samsung Galaxy A27 Renders Indicate a Hole Punch Display Cutout Is Finally Coming; Triple Rear Cameras Expected
  4. DJI Osmo Pocket 4 Launched With 1-Inch CMOS Sensor, Improved Gimbal Stabilisation: Price, Specifications
  5. Intel Core Series 3 Processors Launched With Xe3 GPU, 40 TOPS AI Compute: Availability, Specifications
  6. OnePlus Nord CE 6 Lite Appears on Geekbench With Dimensity 7400 Chip, Android 16
  7. Meta’s Planned Facial Recognition Feature for Smart Glasses Faces Opposition From Privacy Orgs
  8. Vivo X300 Ultra Pricing Surfaces Online via Retail Listing in Europe
  9. YouTube's New Option Lets Users Effectively Turn Off Shorts From Their Feed
  10. South Korea Plans Blockchain-Based Payments for Government Spending
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.