MIT Unveils Novel Method of Training General-Purpose Robots Using Generative AI Techniques

With this method, MIT researchers will align data from varied domains into a shared language that AI models can process.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 4 November 2024 12:35 IST
Highlights
  • MIT says this could be faster and cheaper than traditional techniques
  • The robot training method is also said to be 20 percent more efficient
  • Researchers looked into GPT-4 architecture to develop the technique

To unify data from different domains, MIT used Heterogeneous Pretrained Transformers (HPT) architecture

Photo Credit: Unsplash/Andy Kelly

Massachusetts Institute of Technology (MIT) unveiled a new method to train robots last week that uses generative artificial intelligence (AI) models. The new technique relies on combining data across different domains and modalities and unifying them into a shared language which can then be processed by large language models (LLMs). MIT researchers claim that this method can give rise to general-purpose robots that can handle a wide range of tasks without needing to individually train each skill from scratch.

MIT Researchers Develop AI-Inspired Technique to Train Robots

In a newsroom post, MIT detailed the novel methodology to train robots. Currently, teaching a certain task to a robot is a difficult proposition as a large amount of simulation and real-world data is required. This is necessary because if the robot does not understand how to perform the task in a given environment, it will struggle to adapt to it.

This means for every new task, new sets of data comprising every simulation and real-world scenario are needed. The robot then undergoes a training period where the actions are optimised and errors and glitches are removed. As a result, robots are generally trained on a specific task, and those multi-purpose robots seen in science fiction movies, have not been seen in reality.

Advertisement

However, a new technique developed by researchers at MIT claims to bypass this challenge. In a paper published in the pre-print online journal arXIv (note: it is not peer-reviewed), the scientists highlighted that generative AI can assist with this problem.

Advertisement

For this, data across different domains, such as simulations and real robots, and different modalities such as vision sensors and robotic arm position encoders, were unified into a shared language that can be processed by an AI model. A new architecture dubbed Heterogeneous Pretrained Transformers (HPT) was also developed to unify the data.

Interestingly, the lead author of the study, Lirui Wang, an electrical engineering and computer science (EECS) graduate student, said that the inspiration for this technique was drawn from AI models such as OpenAI's GPT-4.

Advertisement

The researchers added an LLM model called a transformer (similar to the GPT architecture) in the middle of their system and it processes both vision and proprioception (sense of self-movement, force, and position) inputs.

The MIT researchers state that this new method could be faster and less expensive to train robots compared to the traditional methods. This is largely due to the lesser amount of task-specific data required to train the robot in various tasks. Further, the study found that this method outperformed training from scratch by more than 20 percent in both simulation and real-world experiments.

 

Catch the latest from the Consumer Electronics Show on Gadgets 360, at our CES 2026 hub.

Advertisement

Related Stories

Popular Mobile Brands
  1. Thadayam OTT Release Details Revealed Online: Know Everything About This Upcoming Crime Th
  2. Realme Neo 8 Launched With 8,000mAh Battery: See Price, Features
  3. YouTube Takes on OpenAI's Sora With AI-Generated Shorts Feature
  4. iPhone 18 Could Launch With Brighter Display, BOE May Lose Supplier Role
  1. NASA Selects Three New Lunar Science Instruments for Artemis Moon Missions
  2. NASA Astronaut Sunita Williams Retires After 27 Years of Space Service
  3. Realme Neo 8 Launched With Snapdragon 8 Gen 5 Chip, 8,000mAh Battery: Price, Features
  4. Apple Asks Delhi High Court to Stop Competition Commission of India From Seeking Its Financials
  5. Amazon Great Republic Day Sale: Top Last Minute Deals on Smartphones, Smart TVs and Home Appliances
  6. Amazon Great Republic Day Sale: Best Deals on Robot Vacuum Cleaners
  7. OnePlus 15T Lands on 3C Certification Database Ahead of Launch in China: Expected Specifications
  8. Crimson Desert Has Officially Gone Gold, Launch Set for March 19
  9. Acer Chromebook Spin 311, Chromebook 311 Launched With MediaTek Kompanio 540 CPU: Price, Features
  10. Samsung Galaxy S26+ Bags 3C Certification; Might Not Launch With Charging Upgrade
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.