OpenAI Releases Operator AI Agent in Preview, Can Independently Perform Tasks on the Web

Operator is powered by Computer-Using Agent (CUA), an agentic model with GPT-4o’s vision capabilities.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 24 January 2025 14:19 IST
Highlights
  • Operator is currently available to ChatGPT Pro users in the US
  • OpenAI plans to launch the AI agent to more subscription tiers eventually
  • CUA can interact with graphical user interfaces (GUIs)
OpenAI Releases Operator AI Agent in Preview, Can Independently Perform Tasks on the Web

OpenAI’s Operator comes with its own dedicated web browser

Photo Credit: Unsplash/Levart_Photographer

OpenAI released its first artificial intelligence (AI) agent, Operator, on Thursday. Currently available as a research preview, the agent comes with a dedicated web browser. It is a general-purpose AI agent that can autonomously perform tasks online based on prompts given by the user. The AI firm said the tool can be used to book tickets online, reserve a table in a restaurant, or buy a product online. Currently, Operator is only available in the US to ChatGPT Pro subscribers, but the company plans to expand it to other subscription tiers in the future.

OpenAI Introduces Operator AI Agent

In a live stream, OpenAI CEO Sam Altman introduced the company's first AI agent. Explaining what agents are, Altman said, “AI agents are AI systems that do work for you independently. You give them a task, and they go off and do it. We think it will be a big trend in AI.”

The Operator AI agent interface
Photo Credit: OpenAI

 

Operator is powered by the Computer-Using Agent (CUA), an AI model that combines vision capabilities from GPT-4o with advanced reasoning, an OpenAI blog post explained. The AI agent was post-trained using reinforcement learning. It can interact with graphical user interfaces (GUIs) including buttons, menus, and text fields on the screen. With its dedicated browser, the agent can perform tasks behind the scenes while freeing up the screen for the user.

Advertisement

The AI agent accepts both text and images as input. To complete tasks, the CUA processes raw pixel data of the screen and uses a virtual keyboard and mouse to execute actions. OpenAI claims it can navigate multi-step tasks, handle errors, and can also adapt to unexpected changes.

Advertisement

Use Cases of the Operator AI Agent

Rowan Cheung, founder of the AI newsletter The Rundown AI, had early access to Operator and highlighted some of its use cases in a series of posts on X (formerly known as Twitter). The AI agent was able to plan a weekend trip based on advice from Reddit, a specific budget, and interests. Interestingly, when the agent was blocked from accessing Reddit, it completed the task by running a Bing search with Reddit as a keyword.

In another instance, Cheung asked the Operator to find cryptocurrency tokens worth looking into. During its research, the agent got stuck on an “Are you human” CAPTCHA and immediately pinged the user to take control to confirm. Once Cheung confirmed, the AI agent took control and continued with the task.

Advertisement

The AI agent can seamlessly allow the user to jump in and take control at any given time and edit or change the task. Once the user is done, they can also give the control back to the agent. This ensures that the user has control over the AI agent at all times.

OpenAI also stated that it is collaborating with companies such as DoorDash, eBay, Instacart, and Uber to ensure that Operator respects the terms of service agreements of these businesses while accessing the platforms.

Operator's Safety Risks and Mitigation

Coming to safety, the AI firm claimed that it has run extensive safety testing and has implemented mitigations against three safety classes — misuse, model mistakes, and frontier risks.

To reduce the risk of misuse, OpenAI has trained the CUA model to refuse harmful tasks and illegal or regulated activities. The company has also blocked gambling, adult entertainment, as well as drug and gun retailer websites. In addition, the company has also implemented automated and human-based reviews of user interactions.

For model mistakes or hallucinations, the AI agent is trained to ask for user confirmation before finalising tasks with external side effects. The CUA also declines to help with tasks such as banking transactions and while accessing sensitive websites, the agent requires active user supervision.

Frontier risks are the unexpected actions taken by a state-of-the-art AI model as it is generally not tested exhaustively. OpenAI said the CUA model has been evaluated against its Preparedness Framework, and the Operator System Card provides full details into the safety approach and ongoing improvements.

Currently, Operator is only available via the operator.chatgpt.com URL to ChatGPT Pro subscribers in the US. The company has stated that it plans to integrate the AI agent with all ChatGPT clients in the future. Notably, a ChatGPT Pro subscription is priced at $200 (roughly Rs. 17,200) a month.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. OTT Releases This Week: Pattth, Stolen, Jaat, Bhool Chuk Maaf, and More
  2. Motorola Edge 60 Will Launch in India on This Date
  3. iQOO Z10 Lite 5G India Launch Date, Design and Battery Size Confirmed
  4. Realme GT 7 and GT 7T Review
  5. iPhone 17 May Support 50W Wireless Charging With New MagSafe Chargers
  6. Huawei Band 10 With Up to 14 Days Battery Life Launched in India: See Price
  7. Vivo Y-Series Phone With Curved Display Tipped to Launch Soon in India
  8. Oppo K13x 5G India Launch Teased; to Go on Sale via Flipkart
  1. Xiaomi Smart Band 10 Leaked Marketing Images Suggest Design and Key Features
  2. 'We're Not Done Yet': CD Projekt Red Confirms Cyberpunk 2077 Is Getting Another Update Later This Month
  3. Microsoft Introduces Copilot Shopping With Native Checkout Capability in App
  4. Vivo Y-Series Smartphone With Curved Display Said to Launch in India; Colour Options Leaked
  5. Uber Reportedly Exploring Stablecoin Adoption to Cut Cross-Border Transfer Costs
  6. Tecno Pova 7 Neo 4G Design Spotted in Leaked Hands-On Images; Key Features Surface Online
  7. PhonePe to Launch UPI Payments App for Feature Phones With P2P Transfers, Offline QR Payments
  8. Huawei Mate XT 2 Tipped to Launch in H2 2025 With Upgraded Chipset, Cameras
  9. EA Sports FC 25, FBC: Firebreak and More Join Xbox Game Pass in June
  10. Razer Phantom Collection with Chroma RGB, Dynamic Lighting Support Launched in India: Check Price, Features
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.