OpenAI Says Prompt Injections a Challenge for AI Browsers, Builds an Attacker to Train ChatGPT Atlas

OpenAI says prompt injections remain a key risk for AI browsers and is using an AI attacker to train ChatGPT Atlas.

Advertisement
Written by Akash Dutta, Edited by Ketan Pratap | Updated: 23 December 2025 12:31 IST
Highlights
  • OpenAI flags prompt injections as an ongoing security issue
  • ChatGPT Atlas is trained using an AI-powered attacker
  • The company says the battle against this attack will be long-term

The AI attacker will help ChatGPT Atlas learn to tackle evolving prompt injection techniques

Photo Credit: Unsplash/Glenn Carstens-Peters

OpenAI called prompt injections “one of the most significant risks” and a “long-term AI security challenge” for artificial intelligence (AI) browsers with agentic capabilities, on Monday. The San Francisco-based AI giant highlighted how the cyberattack technique impacts its ChatGPT Atlas browser and shared a new approach to tackle it. The company is using an AI-powered attacker that simulates real-world prompt injection attempts to train browsers. OpenAI said the goal is not to eliminate the threat, but to continuously harden the system as new attack patterns emerge.

OpenAI Is Using AI to Fight Against Prompt Injections

Prompt injection is a technique where an attacker hides instructions using HTML tricks, such as zero font, white-on-white text, or out-of-margin text. This is hidden within normal-looking content that an AI agent is meant to read, such as a webpage, document, or snippet of text. When it processes that content, it may mistakenly treat the hidden instruction as a legitimate command, even though it was not issued by the user. It can then carry out malicious acts due to the access privilege of the AI browser.

In a post, OpenAI explained that prompt injections can be direct, where an attacker clearly tries to override the model's instructions, or indirect, where malicious prompts are embedded inside otherwise normal content. Because ChatGPT Atlas reads and reasons over third-party webpages, it may encounter instructions that were never intended for it but are crafted to influence its behaviour.

Advertisement

To address this, the AI giant has built an automated AI attacker, effectively a system that continuously generates new prompt injection attempts as a simulation. This attacker is used during training and evaluation to stress-test Atlas, exposing weaknesses before they are exploited outside the lab. OpenAI said this allows its teams to identify vulnerabilities faster and update defences more frequently than relying on manual testing alone.

Advertisement

“Prompt injection, like scams and social engineering, is not something we expect to ever fully solve,” OpenAI wrote in the post, adding that the challenge evolves as AI systems become more capable, gaining more permissions and the ability to take more actions. Instead, the company is focusing on layered defences, combining automated attacks, reinforcement learning and policy enforcement to reduce the impact of malicious instructions.

The company said its AI attacker helps create a rapid feedback loop, where new forms of prompt injection discovered by the system can be used to immediately retrain and adjust Atlas. This mirrors how security teams respond to evolving threats on the web, where attackers constantly adapt to new safeguards.

Advertisement

OpenAI did not claim that Atlas is immune to prompt injections. Instead, it framed the work as part of an ongoing effort to keep pace with a problem that changes alongside the technology itself. As AI browsers become more capable and more widely used, the company said sustained investment in automated testing and defensive training will be necessary to limit abuse.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. OTT Releases of the Week: The Raja Saab, Kis Kisko Pyaar Karoon 2, Parasakthi, and More
  2. Claude Opus 4.6 vs GPT-5.3-Codex: Best Agentic Coding AI Model in 2026
  3. Brave Ark 2-in-1 Android PC With Snapdragon 8s Gen 3 Launched in India
  4. Poco X8 Pro Series Price, Colours Inadvertently Listed on Xiaomi's Website
  5. GPT-5.3-Codex Arrives as OpenAI's First AI Model That Helped Build Itself
  6. Samsung Galaxy S26 Ultra 3D Render Offers a 360-Degree Look at Its Design
  7. Google Pixel 10a Spotted in Leaked Images in These Four Colour Options
  1. WhatsApp Will Soon Let You Add a 'Close Friends' Status, Just Like Instagram: Report
  2. Poco X8 Pro Series Price, Colourways Inadvertently Listed on Xiaomi Website in Europe: Expected Specifications
  3. Itel A100 Confirmed to Launch in India Soon; Colourways, Battery Capacity and Durability Teased
  4. Google's February 2026 Discover Core Update to Focus on Local Content, Reduce Clickbait
  5. Apple Eyes Retail Expansion in India, New Job Listings Hint at Apple Store in Hyderabad
  6. After The Last of Us, HBO Is Adapting Baldur's Gate 3 for TV With Craig Mazin as Creator
  7. Oppo Find N6 China Launch Timeline, Durability Improvements Teased: Expected Features, Specifications
  8. GPT-5.3-Codex Released as OpenAI’s First AI Model to Assist in Its Own Development
  9. Unfamiliar Now Streaming on Netflix: Everything You Need to Know About Plot, Cast, and More
  10. Lava Yuva Star 3 Will Reportedly Launch in India Soon With Redesigned Camera Module
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.