Google DeepMind Unveils Genie 2 AI Model, Can Generate Playable 3D Worlds to Train AI Agents

Google said these action-controllable, playable 3D environments can be played by humans or AI agents.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 5 December 2024 19:22 IST
Highlights
  • The new AI model is the successor to Genie which was unveiled in February
  • Google DeepMind’s Genie 2 accepts images as an input
  • The tech giant describes Genie 2 as an AI “world model”

Google says Genie 2 can generate consistent worlds for up to a minute

Photo Credit: Google

Google DeepMind unveiled the successor to the Genie artificial intelligence (AI) model, which could generate endless 2D game worlds, on Wednesday. Dubbed Genie 2, the new AI model is capable of generating unique action-controllable, playable 3D environments based on a single image prompt. Calling Genie 2 an AI “world model”, the company stated that it can generate up to minute-long environments with consistent objects. The company said these generated worlds could be played by humans or can be used to train AI agents.

Google DeepMind Unveils Genie 2 AI Model

In a blog post, the company detailed the new AI model and its capabilities. While its predecessor could only generate game worlds for 2D platformer games, the Genie 2 AI model can generate 3D worlds complete with consistent models that can be interacted with. This means humans or AI agents can walk, run, swim, climb, and perform more actions in these environments.

Genie 2's generative capabilities allow it to generate routes, buildings, and objects that cannot be seen in the input image. These elements are designed and rendered by the model from scratch. Additionally, the foundation model is also capable of maintaining consistency in these environments. This means even when a player moves away from one area and returns back, the environments remain the same.

Advertisement

Apart from this, Genie 2 is capable of generating different perspectives such as first-person views, isometric views, or third-person views. Further, users can also interact with the objects in the generated worlds and can perform actions such as opening a door, bursting a balloon, or climbing a ladder. The model can also be prompted to generate physics-related effects such as water ripples, smoke, gravity, directional lighting, reflections, and more.

Advertisement

Coming to the technical details, DeepMind explained that Genie 2 is an autoregressive latent diffusion model and has been trained on a large video dataset. The transformer architecture also includes an autoencoder which enables frame-by-frame generation of these worlds.

Notably, DeepMind also released an AI model dubbed Scalable Instructable Multiworld Agent or SIMA earlier this year, which is essentially capable of agentic AI functions in 3D worlds. The company says Genie 2 is capable of providing unique environments to similar AI agents and training them for various real-life scenarios.

Advertisement

Since the world model can generate unique environments, Google says this will eliminate the risk of data contamination and will allow developers to correctly assess an AI agent's capabilities.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Further reading: Google, Genie 2, AI, Artificial Intelligence, 3D
Advertisement

Related Stories

Popular Mobile Brands
  1. Motorola Edge 70 Ultra Camera Configuration, Other Key Features Leaked
  2. Hogwarts Legacy Is Currently Free on Epic Games Store: How to Redeem
  3. The Game Awards 2025: See the Full List of Winners
  4. Dominic and the Ladies' Purse OTT Release Date: When and Where to Watch it Online?
  5. WhatsApp Brings a Voicemail-like Feature for Missed Voice and Video Calls
  6. Tomb Raider, Star Wars, Divinity: Everything Announced at The Game Awards
  7. Realme Narzo 90 Series Price in India Leaked; Will Come in These Colourways
  8. Webb Telescope Confirms the Oldest Known Supernova in the Universe
  9. Star's Wobble Around Black Hole Confirms Einstein's Century-Old Prediction
  1. Astronomers Observe Star’s Wobbling Orbit, Confirming Einstein’s Frame-Dragging
  2. Galaxy Collisions Found to Activate Supermassive Black Holes, Euclid Data Shows
  3. JWST Detects Oldest Supernova Ever Seen, Linked to GRB 250314A
  4. Chandra’s New X-Ray Mapping Exposes the Invisible Engines Powering Galaxy Clusters
  5. Blue Origin to Fly First Wheelchair User to Space on New Shepard NS-37
  6. Chandra’s New X-Ray Mapping Exposes the Invisible Engines Powering Galaxy Clusters
  7. Sasivadane Now Streaming on Amazon Prime Video: Everything You Need to Know
  8. Kuttram Purindhavan Now Streaming Online: What You Need to Know?
  9. Lyne Lancer 19 Pro With 2.01-Inch Display, SpO2 Monitoring Launched in India
  10. OpenAI and Disney Reach Licensing Agreement to Bring Its Characters to the Sora App
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.