DeepSeek and Tsinghua Developing Self-Improving AI Models

DeepSeek is calling these new models DeepSeek-GRM — short for “generalist reward modeling”.

Advertisement
By Saritha Rai, Bloomberg | Updated: 7 April 2025 13:38 IST
Highlights
  • DeepSeek is exploring ways make AI models more efficient
  • The aim is to bring AI models in alignment with human preferances
  • DeepSeek's AI revamp strategy uses fewer computing resources

DeepSeek roiled markets with its low-cost reasoning AI model back in January this year

Photo Credit: Reuters

DeepSeek is working with Tsinghua University on reducing the training its AI models need in an effort to lower operational costs.

The Chinese startup, which roiled markets with its low-cost reasoning model that emerged in January, collaborated with researchers from the Beijing institution on a paper detailing a novel approach to reinforcement learning to make models more efficient.

The new method aims to help artificial intelligence models better adhere to human preferences by offering rewards for more accurate and understandable responses, the researchers wrote. Reinforcement learning has proven effective in speeding up AI tasks in narrow applications and spheres. However, expanding it to more general applications has proven challenging — and that's the problem that DeepSeek's team is trying to solve with something it calls self-principled critique tuning. The strategy outperformed existing methods and models on various benchmarks and the result showed better performance with fewer computing resources, according to the paper.

Advertisement

DeepSeek is calling these new models DeepSeek-GRM — short for “generalist reward modeling” — and will release them on an open source basis, the company said. Other AI developers, including Chinese tech giant Alibaba Group Holding. and San Francisco-based OpenAI, are also pushing into a new frontier of improving reasoning and self-refining capabilities while an AI model is performing tasks in real time.

Advertisement

Menlo Park, California-based Meta Platforms Inc. released its latest family of AI models, Llama 4, over the weekend and marked them as its first to use the Mixture of Experts (MoE) architecture. DeepSeek's models rely significantly on MoE to make more efficient use of resources, and Meta benchmarked its new release against the Hangzhou-based startup. DeepSeek hasn't specified when it might release its next flagship model.

© 2025 Bloomberg LP

(This story has not been edited by NDTV staff and is auto-generated from a syndicated feed.)

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Apple's iOS 26.1 May Launch on This Date, Followed By iOS 26.2 Beta Rollout
  2. Here Are the Best Smartphones Under Rs 20,000 With AMOLED Display
  3. Apple is Expected to Launch These Products Next Year
  4. NASA's JWST Produces First-Ever 3D Map of Distant Planet WASP-18b
  5. Lava Agni 4 Will Be Launched on This Date
  6. Apple Enters List of Top 5 Phone Makers in India in Q3 2025: Counterpoint
  7. Oppo Reno 15 Series Might Launch in India Next Month
  8. Realme GT 8 Pro Aston Martin F1 Limited Edition Launch Date Revealed
  9. Red Magic 11 Pro Launched in Global Markets With Slightly Smaller Battery
  1. Japan’s Akatsuki Spacecraft Declared Inoperable, Marking End of Dedicated Venus Missions
  2. NASA’s JWST Produces First-Ever 3D Map of Distant Planet WASP-18b
  3. Bad Girl OTT Release Date Revealed: Know When and Where to Watch This Tamil Movie Online
  4. Dhoolpet Police Station OTT Release: Know When and Where to Watch This Upcoming Crime Series Online
  5. Rockstar Games Co-Founder Says GTA Games Won't Work if Set Outside the US
  6. Iran Tackles Unauthorised Crypto Mining After 95 Percent of Bitcoin Mining Devices Found Operating Illegally
  7. Red Magic 11 Pro Launched Globally With Snapdragon Elite Gen 5, Slightly Smaller Battery: Price, Specifications
  8. Microsoft AI Chief Mustafa Suleyman Calls the Idea of Conscious AI ‘Absurd’: Report
  9. Poco F8 Ultra, Poco F8 Pro Global Launch Around the Corner, Tipster Claims
  10. India’s Smartphone Shipments Grew 5 Percent YoY in Q3 2025; Apple Enters List of Top 5 Phone Makers: Counterpoint
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.