DeepSeek and Tsinghua Developing Self-Improving AI Models

DeepSeek is calling these new models DeepSeek-GRM — short for “generalist reward modeling”.

Advertisement
By Saritha Rai, Bloomberg | Updated: 7 April 2025 13:38 IST
Highlights
  • DeepSeek is exploring ways make AI models more efficient
  • The aim is to bring AI models in alignment with human preferances
  • DeepSeek's AI revamp strategy uses fewer computing resources

DeepSeek roiled markets with its low-cost reasoning AI model back in January this year

Photo Credit: Reuters

DeepSeek is working with Tsinghua University on reducing the training its AI models need in an effort to lower operational costs.

The Chinese startup, which roiled markets with its low-cost reasoning model that emerged in January, collaborated with researchers from the Beijing institution on a paper detailing a novel approach to reinforcement learning to make models more efficient.

The new method aims to help artificial intelligence models better adhere to human preferences by offering rewards for more accurate and understandable responses, the researchers wrote. Reinforcement learning has proven effective in speeding up AI tasks in narrow applications and spheres. However, expanding it to more general applications has proven challenging — and that's the problem that DeepSeek's team is trying to solve with something it calls self-principled critique tuning. The strategy outperformed existing methods and models on various benchmarks and the result showed better performance with fewer computing resources, according to the paper.

Advertisement

DeepSeek is calling these new models DeepSeek-GRM — short for “generalist reward modeling” — and will release them on an open source basis, the company said. Other AI developers, including Chinese tech giant Alibaba Group Holding. and San Francisco-based OpenAI, are also pushing into a new frontier of improving reasoning and self-refining capabilities while an AI model is performing tasks in real time.

Advertisement

Menlo Park, California-based Meta Platforms Inc. released its latest family of AI models, Llama 4, over the weekend and marked them as its first to use the Mixture of Experts (MoE) architecture. DeepSeek's models rely significantly on MoE to make more efficient use of resources, and Meta benchmarked its new release against the Hangzhou-based startup. DeepSeek hasn't specified when it might release its next flagship model.

© 2025 Bloomberg LP

(This story has not been edited by NDTV staff and is auto-generated from a syndicated feed.)

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. These Samsung Phones Will Get Price Drops Ahead of Festive Season
  2. OTT Releases This Week: Mahavatar Narsimha, The Bads of Bollywood, and More
  3. Biggest Offers on Smartphones During Amazon Great Indian Festival Sale
  4. Amazon Great Indian Festival Sale 2025: Check Early Deals on Tablets
  5. Flipkart Big Billion Days Sale: iPhone 17 Available With 10-Minute Delivery
  6. Xiaomi 17 Series Pre-Orders Start in China
  7. Instamart Quick India Movement Sale 2025: Best Offers on Electronics
  8. Vivo X300 Series Official Images Surface Ahead of China Launch
  9. Amazon Sale 2025: Top Deals on Logitech, Dell, HP, and More PC Accessories
  10. Redmi 15C 5G Launched With 50-Megapixel Rear Camera, 6,000mAh Battery
  1. Tencent Says Sony 'Monopolising' Genre Conventions, Seeks Dismissal of Light of Motiram Lawsuit
  2. Samsung Galaxy A17 4G Launched With MediaTek Helio G99 SoC, 5,000mAh Battery: Price, Specifications
  3. Instamart Quick India Movement Sale 2025 Goes Live: Best Offers on Smartphones, Smartwatches and More
  4. Bitcoin Stabilises Near $116,900 as Altcoins Push Higher
  5. Mahavatar Narsimha Now Streaming on Netflix: Everything You Need to Know About This Animated Mythological Drama
  6. Nintendo Switch Online Adds First Third-Party Game Boy Advance Titles from Namco This September
  7. Big Billion Days Sale: Flipkart Minutes Promises Doorstep Delivery of iPhone 17, Galaxy S24 in 10 Minutes
  8. Amazon Sale 2025: Top Deals on Logitech, Dell, HP, and More PC Accessories
  9. Australia’s ASIC Grants Exemptions to Stablecoin Intermediaries
  10. Apple to Reportedly Roll Out Update Addressing Camera Bugs on iPhone Air and iPhone 17 Pro
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.