Google DeepMind Unveils Gemini Robotics 1.5 AI Models to Power General-Purpose Robots

Google DeepMind introduced the Gemini Robotics-ER 1.5 and Gemini Robotics 1.5 AI models.

Advertisement
Written by Akash Dutta, Edited by Rohan Pal | Updated: 26 September 2025 15:22 IST
Highlights
  • Both of these models are built on the core Gemini family of models
  • Gemini Robotics-ER 1.5 is capable of reasoning and acts as orchestrator
  • The Gemini Robotics 1.5 is a vision-language-action (VLA) model

The Gemini Robotics-ER 1.5 available to developers via the Gemini API in Google AI Studio

Photo Credit: Google

Google DeepMind on Thursday unveiled two new artificial intelligence (AI) models in the Gemini Robotics family. Dubbed Gemini Robotics-ER 1.5 and Gemini Robotics 1.5, the two models work in tandem to power general-purpose robots. Compared to any embodied AI models created by the Mountain View-based tech giant, these models offer higher reasoning, vision, and action capabilities across various real-world scenarios. The ER 1.5 model is designed to be the planner or orchestrator, whereas the 1.5 model can perform tasks based on natural language instructions.

Google DeepMind's Gemini AI Models Can Act as the Brain of a Robot

In a blog post, DeepMind introduced and detailed the two new Gemini Robotics models that are designed for general-purpose robots operating in the physical world. Generative AI technology has brought about a major breakthrough in robotics, replacing the traditional interface to communicate with a robot with natural language instructions.

However, when it comes to implementing AI models as the brain of a robot, many challenges remain. For instance, the large language models themselves struggle to understand the spatial and temporal dimensions or make precise movements for different object shapes. This issue existed because a single AI model was both thinking up the plan and executing the plan, making the process error-prone and laggy.

Advertisement

Google's solution to this problem is a two-model setup. Here, the Gemini Robotics-ER 1.5, a vision-language model (VLM), comes with advanced reasoning and tool-calling capabilities. It can create multi-step plans for a task. The company says the model excels in making logical decisions within physical environments, and can natively call tools like Google Search to search for information. It is also said to achieve state-of-the-art (SOTA) performance on various spatial understanding benchmarks.

Advertisement

After the plan has been created, the Gemini Robotics 1.5 springs into action. The vision-language-action (VLA) model can turn visual information and instructions into motor commands, enabling a robot to perform tasks. The model first thinks and creates the most efficient path towards completing an action, and then executes it. It can also explain its thinking process in natural language to bring more transparency.

Google claims this system will allow robots to better understand complex and multi-step commands and then execute them in a single flow. For instance, if a user asks a robot to sort multiple objects into the correct compost, recycling and trash bins, the AI system can first search for local recycling guidelines on the Internet, analyse the objects in front, make a plan to sort them, and execute the action.

Advertisement

Notably, the tech giant says that the AI models were designed to work in robots of any shape and size due to their high spatial understanding and wider adaptability. Currently, the orchestrator Gemini Robotics-ER 1.5 is available to developers via the Gemini application programming interface (API) in Google AI Studio. The VLA model, on the other hand, is only available to select partners.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Apple Watch Series 11 Review
  2. New Aadhaar App Launched for Android and iOS, Brings These Features
  3. iQOO 15 May Come With Five Years OS Upgrades, Seven Years Security Update
  4. Motorola Edge 70 Ultra Specifications Leaked Online; Could Run on This Chipset
  5. Vivo Y500 Pro Goes Official With 7,000mAh Battery
  6. Galaxy S26 Series Leak Suggests Subtle Camera and Battery Upgrades
  7. Samsung Care+ Now Includes Extended Warranty for Home Appliances
  8. Honor X80 Battery Capacity Revealed in New Leak
  1. NASA’s ESCAPADE Mission Will Send Twin Probes to Uncover Mars’s Atmospheric Secrets
  2. Webb Finds Phosphorus-Bearing Gas in an Ancient Brown Dwarf
  3. Bad Weather Delays Blue Origin’s New Glenn Launch of NASA’s Mars Mission
  4. Telusu Kada OTT Release Date: Know When and Where to Watch This Telugu Drama Online
  5. Peking University’s 3-Layer Cooling System Handles Record Chip Heat Loads
  6. Dude OTT Release Date: Know When and Where to Watch Pradeep Ranganathan Starrer Tamil Movie
  7. A Quiet Place: Day One OTT Release Date: Everything You Need to Know About the Apocalyptic Thriller
  8. Anurag Kashyap’s Nishaanchi OTT Release Date Confirmed: When and Where to Watch it Online?
  9. Real Kashmir Football Club OTT Release Date: When and Where to Watch it Online?
  10. Vantara Sanctuary Stories Now Available for Streaming on JioHotstar: What You Need to Know
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.