Nvidia Releases Llama Nemotron AI Reasoning Models for Agentic Workflows

Llama Nemotron AI models are based on Meta’s Llama 3 series of models, with post-training enhancements added by Nvidia.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 20 March 2025 13:20 IST
Highlights
  • The Nvidia reasoning models are aimed at developers and enterprises
  • Llama Nemotron is available in Nano, Super and Ultra size variants
  • It is available via Nvidia’s NIM microservices platform

The Llama Nemotron models are available as an API on Nvidia’s platform and on Hugging Face

Photo Credit: Nvidia

Nvidia released a new family of artificial intelligence (AI) models on Tuesday at its GPU Technology Conference (GTC) 2025. Dubbed Llama Nemotron, these are the company's latest reasoning-focused large language models (LLMs) that are designed to offer a foundation for agentic AI workflows. The Santa Clara-based tech giant said these models were aimed at developers and enterprises to enable them to make advanced AI agents that can either work independently or as connected teams to perform complex tasks. The Llama Nemotron models are currently available via Nvidia's platform and Hugging Face.

Nvidia Introduces New Reasoning-Focused AI Models

In a newsroom post, the tech giant detailed the new AI models. The Llama Nemotron reasoning models are based on Meta's Llama 3 series models, with post-training enhancements added by Nvidia. The company highlighted that the family of AI models display improved capabilities in multistep mathematics, coding, reasoning, and complex decision-making.

The company highlighted that the process improved the accuracy of the models by up to 20 percent compared to the based models. The inference speed is also said to have been improved by five times compared to similar-sized open-source reasoning models. Nvidia claimed that “the models can handle more complex reasoning tasks, enhance decision-making capabilities, and reduce operational costs for enterprises.” With these advancements, the LLM can be used to build and power AI agents.

Advertisement

Llama Nemotron reasoning models are available in three parameter sizes — Nano, Super, and Ultra. The Nano model is best suited for on-device and edge-based tasks that require high accuracy. The Super variant is placed in the middle to offer high accuracy and throughput on a single GPU. Finally, the Ultra model is meant to be run on multi-GPU servers and offers agentic accuracy.

Advertisement

The post-training of the reasoning models was done on the Nvidia DGX Cloud using curated synthetic data generated using the Nemotron platform as well as other open models. The tech giant is also making the tools, datasets, and post-training optimisation techniques used to develop the Llama Nemotron models available to the open-source community.

Nvidia is also working with enterprise partners to bring the models to developers and businesses. These reasoning models and the NIM microservices can be accessed via Microsoft's Azure AI Foundry as well as an option via the Azure AI Agent Services. SAP is also using the models for its Business AI solutions and the AI copilot dubbed Joule, the company said. Other enterprises using Llama Nemotron models include ServiceNow, Accenture, and Deloitte.

Advertisement

The Llama Nemotron Nano and Super models and NIM microservices are available for businesses and developers as an application programming interface (API) via Nvidia's platform as well as its Hugging Face listing. It is available with the permissive Nvidia Open Model License Agreement which allows both research and commercial usage.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Nothing Phone 4a Pro's  Battery, Durability, Charging Details Revealed
  2. Here's How Much the iQOO 15R Might Cost in India
  3. Amazfit Active Max With 1.5-Inch AMOLED Display Launched in India: See Price
  4. The Conjuring: Last Rites OTT Release Date: When and Where to Watch it Online?
  5. Oppo K15 Launch Seems Imminent as Company Teases Launch of a New Phone
  6. Vivo X200T Launched in India With These Features
  7. Border 2 Revives "Sandese Aate Hain": Sunny Deol Returns
  8. HMD Watch X1, Watch P1 Launched as HMD's First Smartwatch Models
  9. HP HyperX Omen 15 Gaming Laptop With RTX 5060 GPU Launched in India
  10. Nothing Phone 4a Lands on TDRA Certification Database Ahead of Its Debut
  1. James Webb Helps Astronomers Chart the Universe’s Hidden Dark Matter
  2. ESA’s Solar Orbiter Reveals How Magnetic Avalanches Trigger Solar Flares
  3. NASA Races to Restore Contact With MAVEN Mars Orbiter After Weeks of Silence
  4. iQOO 15R Price in India, Chipset Details Teased Ahead of Launch in India on February 24
  5. Nothing Phone 4a Pro Battery, Charging Speed and IP Rating Revealed via EPREL Label
  6. Honor Magic V6 Leak Hints at Slimmer Build, New Hardware Upgrades Ahead of Anticipated March Debut
  7. OpenAI Says ChatGPT's Writing Worsened Due to Overtraining Math, Coding
  8. Sony Said to Be Planning State of Play Broadcast for February
  9. Amazon to Reportedly Layoff 16,000 Employees, India Might Be Among Worst-Hit Regions
  10. Hashtag Star Now Available for Streaming on Chaupal: What You Need to Know About This Punjabi Film
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.