AgiBot Robotics Firm Open Sources Massive Dataset to Train Humanoid Robots

The AgiBot World Alpha dataset contains more than one million trajectories from 100 robots.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 31 December 2024 16:50 IST
Highlights
  • AgiBot World spans over 100 real-world scenarios on five target domains
  • The dataset is only available for non-commercial usage
  • AgiBot claimed that this dataset can help researchers in training robots

The dataset also includes fine-grained manipulation, tool usage, and multi-robot collaboration

Photo Credit: AgiBot

AgiBot, a Chinese artificial intelligence (AI) and robotics firm open-sourced a massive dataset containing high-quality data on training humanoid robots on Monday. Dubbed AgiBot World Alpha, the dataset is said to be collected from more than 100 robots in real-life scenarios. The company stated that this dataset can help researchers and developers accelerate the training process of humanoid robots by using AI models to feed this information to specific robotics software. Notably, the dataset is currently being hosted on both GitHub and Hugging Face.

Massive Training Dataset for Humanoid Robots Released

In a press release, the company announced its decision to release AgiBot World. It is said to be a large-scale robotic learning dataset designed for multi-purpose humanoid robots. Apart from the dataset, the open-sourced system also includes foundational models, standardised benchmarks, and a framework to help researchers access the data.

With the rise of generative AI, the robotics space has also witnessed a significant boost. While humanoid robotics hardware has existed for a long time, training these machines for tasks has been complicated. This is because the intelligent software that acts as the mind of the robot has to learn and understand different scenarios and how to navigate through them. This includes learning thousands of movements and combinations of movements and understanding when to apply which movement.

Advertisement

Due to this, the training process used to be very slow and usually focused towards one specialised task instead of general-purpose movements. However, generative AI has given researchers the option to make the software more intelligent by using neural frameworks. This allows robots to understand the context of a situation and solve it by processing a large volume of information in near real-time.

Advertisement

But this growth has also highlighted another gap in the robotics space — the lack of high-quality data. The training process of robots typically takes place in controlled environments and in isolated areas to allow researchers to monitor the robots and make required changes. Due to this, training data involving real-world scenarios is scarce.

The AgiBot World dataset fills this important gap. The company claimed that the open-source dataset includes more than one million trajectories from 100 robots. It also spans more than 100 real-world scenarios across five target domains. It also includes complex movements such as fine-grained manipulation, tool usage, and multi-robot collaboration.

Advertisement

This dataset can be accessed from either AgiBot's GitHub listing or its Hugging Face page. However, the dataset is only available under the Creative Commons CC BY-NC-SA 4.0 licence, which allows for academic and research-related usage but does not permit commercial use cases.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement
Popular Mobile Brands
  1. How to Disable the Liquid Glass Effect After Updating to iOS 26.1
  2. Moto G67 Power 5G Specifications Revealed: See Storage Variants, Features
  3. Lava Agni 4 Confirmed to Feature Aluminium Frame, New Dedicated Button
  4. Why Bitcoin's Price Has Dropped Below $105,000
  5. Stream Finance Discloses $93 Million Loss, Halts Operations
  1. Dispatch, Episodic Superhero Game Starring Breaking Bad's Aaron Paul, Sells 1 Million Copies in 10 Days
  2. Nothing Phone 3a Lite Owners Can Uninstall Meta Services After Company Faces Backlash Over Preloaded Apps
  3. Lovable Partners With Guardio to Detect and Block Malicious Websites Created via Vibe Coding
  4. Stream Finance Discloses $93 Million Loss After Probe, Halts Operations
  5. Samsung Galaxy S26 Series Price Hike Likely Due to Rising Price of Key Components: Report
  6. Hong Kong Unveils Fintech 2030 Strategy to Accelerate AI, RWA Tokenisation
  7. Raat Akeli Hai: The Bansal Murders to Release on OTT Soon: Everything You Need to Know
  8. OpenAI Faces Backlash from Studio Ghibli, Bandai Namco Over AI-Generated Anime Videos
  9. OnePlus Ace 6 Pro Max Retail Box Leak Hints at Imminent Launch, Snapdragon 8 Gen 5 SoC
  10. Nintendo Switch 2 Crosses 10 Million Units Sold, Nintendo Hikes Full-Year Sales Forecast
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.