MIT Unveils Novel Method of Training General-Purpose Robots Using Generative AI Techniques

With this method, MIT researchers will align data from varied domains into a shared language that AI models can process.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 4 November 2024 12:35 IST
Highlights
  • MIT says this could be faster and cheaper than traditional techniques
  • The robot training method is also said to be 20 percent more efficient
  • Researchers looked into GPT-4 architecture to develop the technique

To unify data from different domains, MIT used Heterogeneous Pretrained Transformers (HPT) architecture

Photo Credit: Unsplash/Andy Kelly

Massachusetts Institute of Technology (MIT) unveiled a new method to train robots last week that uses generative artificial intelligence (AI) models. The new technique relies on combining data across different domains and modalities and unifying them into a shared language which can then be processed by large language models (LLMs). MIT researchers claim that this method can give rise to general-purpose robots that can handle a wide range of tasks without needing to individually train each skill from scratch.

MIT Researchers Develop AI-Inspired Technique to Train Robots

In a newsroom post, MIT detailed the novel methodology to train robots. Currently, teaching a certain task to a robot is a difficult proposition as a large amount of simulation and real-world data is required. This is necessary because if the robot does not understand how to perform the task in a given environment, it will struggle to adapt to it.

This means for every new task, new sets of data comprising every simulation and real-world scenario are needed. The robot then undergoes a training period where the actions are optimised and errors and glitches are removed. As a result, robots are generally trained on a specific task, and those multi-purpose robots seen in science fiction movies, have not been seen in reality.

Advertisement

However, a new technique developed by researchers at MIT claims to bypass this challenge. In a paper published in the pre-print online journal arXIv (note: it is not peer-reviewed), the scientists highlighted that generative AI can assist with this problem.

Advertisement

For this, data across different domains, such as simulations and real robots, and different modalities such as vision sensors and robotic arm position encoders, were unified into a shared language that can be processed by an AI model. A new architecture dubbed Heterogeneous Pretrained Transformers (HPT) was also developed to unify the data.

Interestingly, the lead author of the study, Lirui Wang, an electrical engineering and computer science (EECS) graduate student, said that the inspiration for this technique was drawn from AI models such as OpenAI's GPT-4.

Advertisement

The researchers added an LLM model called a transformer (similar to the GPT architecture) in the middle of their system and it processes both vision and proprioception (sense of self-movement, force, and position) inputs.

The MIT researchers state that this new method could be faster and less expensive to train robots compared to the traditional methods. This is largely due to the lesser amount of task-specific data required to train the robot in various tasks. Further, the study found that this method outperformed training from scratch by more than 20 percent in both simulation and real-world experiments.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. OnePlus 15: Everything We Know Ahead of Its Upcoming Launch in China
  2. We Live in Time OTT Release: When, Where to Watch the Andrew Garfield Starrer
  3. How to Spot Comet SWAN During Its Close Flyby of Earth
  1. NASA Experiment Shows Martian Ice Could Preserve Signs of Ancient Life
  2. MIT Detects Traces of a Lost ‘Proto Earth’ Deep Beneath Our Planet’s Surface
  3. Astronomers Detect Heavy Water in Planet-Forming Disk Around Young Star
  4. Global Projects Aim to Save Sinking Cities From Rising Seas and Climate Change
  5. NASA Confirms Brightening Comet SWAN Could Be Visible With Binoculars: When and Where to See It
  6. We Live in Time OTT Release: When, Where to Watch the Andrew Garfield and Florence Pugh Romance
  7. Imbam Is Now Streaming Online: Know Everything About This Deepak Parambol Starrer Malayali Drama
  8. Mysterious Asteroid Impact Found in Australia, But the Crater is Missing
  9. Thanal Comes to OTT: Everything You Need to Know About This Tamil Action Thriller
  10. Madam Sengupta Is Now Streaming: Know Where to Watch This Bangla Crime Thriller
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.