AgiBot Robotics Firm Open Sources Massive Dataset to Train Humanoid Robots

The AgiBot World Alpha dataset contains more than one million trajectories from 100 robots.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 31 December 2024 16:50 IST
Highlights
  • AgiBot World spans over 100 real-world scenarios on five target domains
  • The dataset is only available for non-commercial usage
  • AgiBot claimed that this dataset can help researchers in training robots

The dataset also includes fine-grained manipulation, tool usage, and multi-robot collaboration

Photo Credit: AgiBot

AgiBot, a Chinese artificial intelligence (AI) and robotics firm open-sourced a massive dataset containing high-quality data on training humanoid robots on Monday. Dubbed AgiBot World Alpha, the dataset is said to be collected from more than 100 robots in real-life scenarios. The company stated that this dataset can help researchers and developers accelerate the training process of humanoid robots by using AI models to feed this information to specific robotics software. Notably, the dataset is currently being hosted on both GitHub and Hugging Face.

Massive Training Dataset for Humanoid Robots Released

In a press release, the company announced its decision to release AgiBot World. It is said to be a large-scale robotic learning dataset designed for multi-purpose humanoid robots. Apart from the dataset, the open-sourced system also includes foundational models, standardised benchmarks, and a framework to help researchers access the data.

Advertisement

With the rise of generative AI, the robotics space has also witnessed a significant boost. While humanoid robotics hardware has existed for a long time, training these machines for tasks has been complicated. This is because the intelligent software that acts as the mind of the robot has to learn and understand different scenarios and how to navigate through them. This includes learning thousands of movements and combinations of movements and understanding when to apply which movement.

Due to this, the training process used to be very slow and usually focused towards one specialised task instead of general-purpose movements. However, generative AI has given researchers the option to make the software more intelligent by using neural frameworks. This allows robots to understand the context of a situation and solve it by processing a large volume of information in near real-time.

Advertisement

But this growth has also highlighted another gap in the robotics space — the lack of high-quality data. The training process of robots typically takes place in controlled environments and in isolated areas to allow researchers to monitor the robots and make required changes. Due to this, training data involving real-world scenarios is scarce.

The AgiBot World dataset fills this important gap. The company claimed that the open-source dataset includes more than one million trajectories from 100 robots. It also spans more than 100 real-world scenarios across five target domains. It also includes complex movements such as fine-grained manipulation, tool usage, and multi-robot collaboration.

Advertisement

This dataset can be accessed from either AgiBot's GitHub listing or its Hugging Face page. However, the dataset is only available under the Creative Commons CC BY-NC-SA 4.0 licence, which allows for academic and research-related usage but does not permit commercial use cases.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement
Popular Mobile Brands
  1. OpenAI and Amazon Announce a Multi-Year Strategic Partnership on AI
  2. YouTube's 'Ask YouTube' AI Chatbot Offers Smart Replies With Videos, Shorts
  3. Qualcomm Rises on Smartphone Rebound Hopes, Data-Centre Chip Push
  4. Anthropic's New Connectors Will Make Claude More Creative
  1. Intel's 'Wildcat Lake' Processor Shows Up on Benchmark Database, Could Rival A18 Pro Chip on MacBook Neo
  2. Qualcomm Rises on Smartphone Rebound Hopes, Data-Centre Chip Push
  3. Gemini Now Lets Users Generate and Export Files With Support for PDF, Word Formats
  4. Memory Component to Reportedly Account for 45 Percent Value of an iPhone Due to Supply Chain Issues
  5. AirDrop via Quick Share Reportedly Expands to Oppo Find X9 Ultra, Vivo X300 Ultra
  6. OpenAI, Amazon Announce Multi-Year Strategic Partnership as Microsoft’s Exclusive Deal Ends
  7. US Judge Rejects Former FTX CEO Sam Bankman-Fried’s Bid for New Trial
  8. Valve Says It's 'Hard at Work' on Steam Deck 2
  9. OnePlus Nord CE 6, Nord CE 6 Lite Availability Details Announced Ahead of May 7 Launch Date
  10. Smartphone Buyers in India Prioritise AI and Real-World Usage, Flipkart Report Shows
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.