Hugging Face Expands LeRobot Platform With Multimodal Dataset for AI-Powered Cars

The new dataset, dubbed Learning to Drive (L2D), was developed in partnership with the AI startup Yaak.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 12 March 2025 18:25 IST
Highlights
  • The L2D dataset is more than 1PB in size
  • L2D was collected from sensors installed on 60 EVs
  • The sensors include cameras, GPS, IMUs, and more

Hugging Face called L2D the world’s largest open-source multimodal dataset for spatial intelligence

Photo Credit: Hugging Face

Hugging Face announced the expansion of its LeRobot platform on Wednesday with a large dataset aimed at automotive automation. The online artificial intelligence (AI) and machine learning (ML) repository said that the dataset was created in collaboration with the AI startup Yaak. Dubbed Learning to Drive (L2D), the dataset was collected from a suite of sensors installed on 60 electric vehicles (EVs) over a period of three years. The open-source dataset is aimed at enabling developers and the robotics community to build spatial intelligence solutions for the automobile industry.

Hugging Face Adds L2D Dataset to LeRobot

In a blog post, the company detailed the new AI dataset, calling it “the world's largest multimodal dataset aimed at building an open-sourced spatial intelligence for the automotive domain.” The entire dataset is more than 1PB (one PetaByte) in size, and was collected using sensor suites installed on 60 EVs operated by driving schools in 30 German cities for three years. Identical sensors were used to ensure consistency in the data collected.

The LeRobot platform was launched last year as a collection of open-source AI models, datasets, and accompanying tools that can help developers build AI-powered robotics systems.

Advertisement

The Learning to Drive dataset
Photo Credit: Hugging Face

Advertisement

 

The policies in the dataset are divided into two groups of expert policies and student policies. The former is comprised of data from driving instructors while the latter comes from learner drivers. Hugging Face stated that the expert policy has zero driving mistakes and is considered optimal, whereas the student policy contains known sub-optimalities. Both groups include natural language instructions for driving tasks.

Advertisement

Each group features all driving scenarios that are necessary for completion to obtain a driving licence in the European Union (EU). Some of these driving tasks include overtaking, roundabout handling, and track driving.

Detailing the sensor suite used to capture the L2D data, Hugging Face said that each of the 60 Kia Niro EV models were equipped with six RGB cameras to capture the vehicle's surrounding in 360p, on-board GPS for vehicle location and mapping, an inertial measurement unit (IMU) to capture vehicle dynamics. All the data was captured with timestamps.

Advertisement

Notably, the dataset is aimed at helping developers and robotics scientists build end-to-end self-driving AI models that can eventually be used to build fully autonomous vehicle systems.

Hugging Face highlighted that the L2D dataset will be released in a phased manner, where each successive release will be a superset of the previous releases to ensure ease of access. The platform is also inviting the community to submit models for closed loop testing of the dataset with a safety driver. This will begin in summer 2025.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Bridgerton Season 4 Premieres in Two Parts on Netflix: See Details
  2. Sister Midnight Streaming Online: Everything You Need to Know
  3. All the Details About Kunal Khemu's Comedy Drama 'Single Papa'
  4. Scientists Track Glowing Green Comet 3I/ATLAS as It Nears Earth
  5. Nandamuri Balakrishna's Akhanda 2 Arrives on OTT in 2026
  1. Early Earth’s Deep Mantle May Have Held More Water Than Previously Believed, Study Finds
  2. Nandamuri Balakrishna's Akhanda 2 Arrives on OTT in 2026: When, Where to Watch the Film Online?
  3. Single Papa Now Streaming on OTT: All the Details About Kunal Khemu’s New Comedy Drama Series
  4. Scientists Study Ancient Interstellar Comet 3I/ATLAS, Seeking Clues to Early Star System Formation
  5. Bridgerton Season 4 to Release in Two Parts on OTT: When and Where to Watch It Online?
  6. Spider-Like Scar on Jupiter’s Moon Europa Could Indicate Subsurface Salty Water
  7. Wake Up Dead Man: A Knives Out Mystery Now Streaming on Netflix: Everything You Need to Know
  8. Secret Rain Pattern May Have Driven Long Spells of Dry and Wetter Periods Across Horn of Africa: Study
  9. Sister Midnight Out on OTT: Know Where to Watch This Radhika Apte-Starrer Online
  10. JWST Detects Thick Atmosphere on Ultra-Hot Rocky Exoplanet TOI-561 b
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.