Hugging Face Is Trying to Build a Fully Open-Source Version of DeepSeek-R1 AI Model

While DeepSeek-R1 model weights are available in the public domain, the datasets and code used to train the model are not.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 29 January 2025 14:14 IST
Highlights
  • Hugging Face is trying to replicate building DeepSeek-R1
  • The high-quality datasets might have played a big part in optimising R1
  • The method will help researchers understand the training methods

DeepSeek-R1 AI model has surpassed OpenAI’s o1 in several benchmarks

Photo Credit: Unsplash/Markus Winkler

Hugging Face announced a new initiative on Tuesday to build Open-R1, a fully open reproduction of the DeepSeek-R1 model. The hedge fund-backed Chinese AI firm released the DeepSeek-R1 artificial intelligence (AI) model in the public domain last week, sending shockwaves across Silicon Valley and NASDAQ. A big reason was that such an advanced and large-scale AI model, that could overtake OpenAI's o1 model, has not yet been released in open-source. However, the model was not fully open-source, and Hugging Face researchers are now trying to find the missing pieces.

Why Is Hugging Face Building Open-R1?

In a blog post, Hugging Face researchers detailed their reason behind replicating DeepSeek's famed AI model. Essentially, DeepSeek-R1 is what is known as a “black-box” release, meaning that the code and other assets needed to run the software are available however, the dataset as well as training code are not. This means anyone can download and run the AI model locally, but the information needed to replicate a model like it is not possible.

Some of the unreleased information includes the reasoning-specific datasets used to train the base model, the training code used to create the hyperparameters that allow the model to break down and process complex queries, and the compute and data trade-offs used in the training process.

Advertisement

The researchers said that the aim behind building a fully open-source version of DeepSeek-R1 is to provide transparency about reinforcement learning's enhanced outcome and to share reproducible insights with the community.

Advertisement

Hugging Face's Open-R1 Initiative

Since DeepSeek-R1 is available in the public domain, researchers were able to understand some aspects of the AI model. For instance, DeepSeek-V3, the base model used to create R1, was built with pure reinforcement learning without any human supervision. However, the reasoning-focused R1 model used several refinement steps that reject low-quality outputs, and produces polished and consistent answers.

To do this, Hugging Face researchers have developed a three-step plan. First, a distilled version of R1 will be created using its dataset. Then, the researchers will try to replicate the pure reinforcement learning pattern, and then the researchers will include supervised fine-tuning and further reinforcement learning till they adjust the responses on par with R1.

Advertisement

The synthetic dataset derived from distilling the R1 model as well as the training steps will then be released to the open-source community to allow developers to transform existing large language models (LLMs) into reasoning models just by fine-tuning them.

Notably, Hugging Face used a similar process to distil the Llama 3B AI model to show that test time compute (also known as inference time compute) can significantly enhance small language models.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Vivo Y51 Pro 5G Launched With 7,200mAh Battery at This Price in India
  2. Canva's AI-Powered Magic Layers Turns Images Into Editable Designs
  3. Samsung Galaxy A57 Renders Leak Online Again; Launch Expected Soon
  4. DxOMark Ranks iPhone 17 Pro Above Galaxy S26 Ultra in Camera Performance
  5. Xiaomi 17 Ultra Finally Arrives in India at This Price
  6. Xiaomi 17 Launched in India With Snapdragon 8 Elite Gen 5, Leica Cameras
  7. Exclusive: iQOO to Skip Neo Series Launch in India in 2026
  8. Poco X8 Pro Series Confirmed to Launch in India With This Battery
  9. Samsung Galaxy S26 Series Goes on Sale in India: See Price, Features
  1. WhatsApp Adds Support for Parent-Managed Accounts With Stricter Controls for Children Under 13
  2. Crimson Desert PC and Console Specs Revealed: Here's How the Game Will Run on PS5 and Xbox Series S/X
  3. Perplexity Ordered to Stop Deploying Shopping AI Agents on Amazon: Report
  4. Sonos Play and Sonos Era 100 SL Launched With Wi-Fi 6 Connectivity, AirPlay 2 Support: Price, Features
  5. Oppo Find N6 Colourways, Storage Variants Revealed as Company Teases Crease-Free Display's Components
  6. Canva’s New AI-Powered Magic Layers Feature Turns Images Into Editable Designs
  7. Tokenised Real-World Assets See 66 Percent Jump in 2026, DeFiLlama Data Shows
  8. YouTube’s Likeness Detection Tool Expanded to Government Officials and Journalists
  9. GainBitcoin Crypto Scam Case: CBI Arrests Darwin Labs CTO and Co-Founder Ayush Varshney
  10. Realme P4 Lite 5G India Launch Teased as Company Hints at Design and Availability
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.