MIT Unveils Novel Method of Training General-Purpose Robots Using Generative AI Techniques

With this method, MIT researchers will align data from varied domains into a shared language that AI models can process.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 4 November 2024 12:35 IST
Highlights
  • MIT says this could be faster and cheaper than traditional techniques
  • The robot training method is also said to be 20 percent more efficient
  • Researchers looked into GPT-4 architecture to develop the technique
MIT Unveils Novel Method of Training General-Purpose Robots Using Generative AI Techniques

To unify data from different domains, MIT used Heterogeneous Pretrained Transformers (HPT) architecture

Photo Credit: Unsplash/Andy Kelly

Massachusetts Institute of Technology (MIT) unveiled a new method to train robots last week that uses generative artificial intelligence (AI) models. The new technique relies on combining data across different domains and modalities and unifying them into a shared language which can then be processed by large language models (LLMs). MIT researchers claim that this method can give rise to general-purpose robots that can handle a wide range of tasks without needing to individually train each skill from scratch.

MIT Researchers Develop AI-Inspired Technique to Train Robots

In a newsroom post, MIT detailed the novel methodology to train robots. Currently, teaching a certain task to a robot is a difficult proposition as a large amount of simulation and real-world data is required. This is necessary because if the robot does not understand how to perform the task in a given environment, it will struggle to adapt to it.

This means for every new task, new sets of data comprising every simulation and real-world scenario are needed. The robot then undergoes a training period where the actions are optimised and errors and glitches are removed. As a result, robots are generally trained on a specific task, and those multi-purpose robots seen in science fiction movies, have not been seen in reality.

However, a new technique developed by researchers at MIT claims to bypass this challenge. In a paper published in the pre-print online journal arXIv (note: it is not peer-reviewed), the scientists highlighted that generative AI can assist with this problem.

Advertisement

For this, data across different domains, such as simulations and real robots, and different modalities such as vision sensors and robotic arm position encoders, were unified into a shared language that can be processed by an AI model. A new architecture dubbed Heterogeneous Pretrained Transformers (HPT) was also developed to unify the data.

Interestingly, the lead author of the study, Lirui Wang, an electrical engineering and computer science (EECS) graduate student, said that the inspiration for this technique was drawn from AI models such as OpenAI's GPT-4.

Advertisement

The researchers added an LLM model called a transformer (similar to the GPT architecture) in the middle of their system and it processes both vision and proprioception (sense of self-movement, force, and position) inputs.

The MIT researchers state that this new method could be faster and less expensive to train robots compared to the traditional methods. This is largely due to the lesser amount of task-specific data required to train the robot in various tasks. Further, the study found that this method outperformed training from scratch by more than 20 percent in both simulation and real-world experiments.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. Vivo T4 Ultra to Launch in India on This Date
  2. OnePlus 13s Set to Launch in India Tomorrow: Know Price, Specifications
  3. iOS 26 to Arrive With These Upgrades for Messages, Apple Music and CarPlay
  4. ChatGPT Will Now Reference Past Chats to Even Those on the Free Tier
  5. Pixel 10 Series Said to Offer Gimbal-Like Video Stabilisation
  6. iPhone 17, iPhone 17 Air Might Feature 120Hz Displays, But There's a Catch
  7. Apple Announces Design Awards 2025 Winners and Finalists: Check List
  1. Nizharkudai Now Streaming on Aha Tamil: What You Need to Know About Tamil Family Drama
  2. Sony's State of Play Broadcast Announced for June 4: How to Watch, What to Expect
  3. OnePlus 13s India Launch Tomorrow: Expected Price, Specifications and How to Watch Livestream
  4. Apple Vision Pro to Get Native Support for PlayStation, Xbox and Spatial Controllers With visionOS 26: Report
  5. Qualcomm Fixes Zero-Day Security Vulnerabilities Used By Hackers, Cybercriminals
  6. OpenAI Is Rolling Out ChatGPT’s Memory Improvements to Free Users; Codex Gets Full Internet Access
  7. Motorola Razr 60 Now Available for Purchase in India: Price, Offers, Specifications
  8. Bitcoin Stabilises at Around $105,500, Most Altcoins See Minor Profits
  9. Apple Design Awards 2025 Winners Announced: CapWords, Speechify and Neva Bag Top Spots
  10. Honor Magic V5 Allegedly Listed on Geekbench, Suggesting Key Specifications 
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.