Google DeepMind Introduces SIMA 2, a Gemini-Powered AI Agent That Can Play Video Games

Google DeepMind’s SIMA 2 AI agent is the successor of SIMA, which was released in March 2024.

Advertisement
Written by Akash Dutta, Edited by Ketan Pratap | Updated: 14 November 2025 18:39 IST
Highlights
  • SIMA stands for Scalable Instructable Multiworld Agent
  • Google says SIMA 2 can reason to think about its goals
  • The AI agent can also improve itself over time

SIMA 2 can also interact with the user and explain what it intends to do

Photo Credit: Google

Google DeepMind introduced Scalable Instructable Multiworld Agent (SIMA) 2, an artificial intelligence (AI) agent, on Thursday. It is the successor of SIMA, which was unveiled in March 2024, and comes with several improvements over it. SIMA 2 is powered by Gemini models and can now think about its actions, reason over it, and even interact with the user via a text interface. The core functionality remains the same: it is designed to play 3D open-world video games, but it now does so more effectively. The company says SIMA 2 also improves over time, learning from its experiences.

SIMA 2 Can Now Reason, Interact, and Play Games Better

In a blog post, Google DeepMind introduced and detailed the SIMA 2 AI agent. Powered by Gemini, it is not only able to execute tasks given by humans but also understand what is being asked, reason about the environment, and plan its next steps accordingly.

Advertisement

The system ingests visual input (the game screen or virtual world imagery) and a human-issued goal (for example: “build a shelter” or “find the red house”), then the agent interprets that goal, constructs intermediate actions and performs them via keyboard/mouse style outputs.

One of the biggest improvements in SIMA 2 is its ability to familiarise itself with new games and environments it has not been trained on. The system was evaluated in previously unseen games, such as Minedojo (a research version of Minecraft) and ASKA, a Viking survival game, and achieved better success rates compared with its predecessor.

Advertisement

It also accepts multimodal prompts (sketches, emojis, different languages) and can transfer concepts. For instance, it may learn “mining” in one game and apply its learnings to the notion of “harvesting” in another, without having to start from zero.

Coming to the training setup for the AI agent, SIMA 2's dataset uses human-demonstration data and auto-generated annotation by Gemini. Additionally, whenever it learns a new motion or skill in novel environments, the data is collected and fed back to train subsequent generations of the agent. DeepMind says this reduces reliance on human-labelled data and allows SIMA 2 to continue improving from its own play.

Advertisement

SIMA 2 still has limitations: the model's memory of past interactions remains constrained, very long-horizon reasoning (many steps ahead) remains challenging, and precise low-level actions (such as robot-style joint control) are not addressed within this game-world framework.

Despite its prowess in video games, SIMA 2 is not being developed to become a gaming assistant. DeepMind believes that by training and testing the agent in unique 3D worlds, the learnings can then be applied to embedded AI, which powers robots that work in the real world. Ultimately, the goal is to create a general-purpose robot capable of handling multiple tasks and controllable via natural language instructions.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Vivo X300 Ultra, Vivo X300s Launched With Zeiss-Tuned Cameras and Teleconverter Support
  2. Oppo Find X9 Ultra Listed on BIS Database, Might Launch in India Soon
  3. Vi Says It Will Expand 5G Coverage to 90 More Cities Within Two Months
  4. Here's How Third Party Chatbots Might Work With Siri on iOS 27
  5. AirDrop-Like Tap to Share Feature Spied in One UI 9 and Android 17 Builds
  6. Lava Bold N2 Pro 4G Will Launch in India on This Date: See Key Features
  1. Vivo Pad 6 Pro Launched With 13.2-Inch 4K Display, Snapdragon 8 Elite Gen 5 Chip: Price, Specifications
  2. Vivo X300 Ultra With Snapdragon 8 Elite Gen 5 SoC Launched Alongside Vivo X300s: Price, Features
  3. Vi 5G Rollout: Telco Says It Will Expand 5G Coverage in 90 Cities Within Two Months
  4. Google Reportedly Working on AirDrop-Like Tap to Share Feature Discovered in One UI 9, Android 17 Builds
  5. OnePlus Ace 6 Ultra Tipped to Launch in April, Could Rival Redmi K90 Ultra
  6. Oppo Find X9 Ultra Gets One Step Closer to Launching in India as Handset Surfaces on BIS Database
  7. Vivo X300s Specifications Officially Confirmed; Will Feature 200-Megapixel Main Camera and 7,100mAh Battery
  8. Lava Bold N2 Pro 4G India Launch Date Set for March 31, Company Reveals Key Specifications
  9. Apple's New Siri App on iOS 27 Supports Text and Voice Modes, Adds 'Extensions' for Third-Party Chatbots: Gurman
  10. Apple's First Foldable iPhone Could Be Company's Biggest Design Overhaul Yet: Mark Gurman
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.