Mistral Small 3.1 AI Model With Improved Text and Multimodal Performance Released

Mistral Small 3.1 AI model is a 24 billion parameter model with a context window of 1,28,000 tokens.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 18 March 2025 14:20 IST
Highlights
  • Mistral Small 3.1 can run on a single RTX 4090 or a Mac with 32GB RAM
  • It offers function calling and function execution for agentic workflow
  • Mistral Small 3.1 models are available on Hugging Face

Mistral is offering the AI models with an Apache 2.0 licence

Photo Credit: Unsplash/Solen Feyissa

Mistral Small 3.1 artificial intelligence (AI) model was released on Monday. The Paris-based AI firm introduced two open-source variants of the latest model — chat and instruct. The model comes as the successor to the Mistral Small 3, and offers improved text performance and multimodal understanding. The company claims that it outperforms comparable models such as Google's Gemma 3 and OpenAI's GPT-4o mini on several benchmarks. One of the key advantages of the newly introduced model is its rapid response times.

Mistral Small 3.1 AI Model Released

In a newsroom post, the AI firm detailed the new models. The Mistral Small 3.1 comes with an expanded context window of up to 1,28,000 tokens and is said to deliver inference speeds of 150 tokens per second. This essentially means the response time of the AI model is quite fast. It arrives in two variants of chat and instruct. The former works as a typical chatbot whereas the latter is fine-tuned to follow user instructions and is useful when building an application with a specific purpose.

Mistral Small 3.1 benchmark
Photo Credit: Mistral

Advertisement

 

Similar to its previous releases, the Mistral Small 3.1 is available in the public domain. The open weights can be downloaded from the firm's Hugging Face listing. The AI model comes with an Apache 2.0 licence which allows academic and research usage but forbids commercial use cases.

Advertisement

Mistral said that the large language model (LLM) is optimised to run on a single Nvidia RTX 4090 GPU or a Mac device with 32GB RAM. This means enthusiasts without an expensive setup to run AI models can also download and access it. The model also offers low-latency function calling and function execution which can be useful for building automation and agentic workflows. The company also allows developers to fine-tune the Mistral Small 3.1 to fit the use cases of specialised domains.

Advertisement

Coming to performance, the AI firm shared various benchmark scores based on internal testing. The Mistral Small 3.1 is said to outperform Gemma 3 and GPT-4o mini on the Graduate-Level Google-Proof Q&A (GPQA) Main and Diamond, HumanEval, MathVista, and the DocVQA benchmarks. However, GPT-4o mini performed better on the Massive Multitask Language Understanding (MMLU) benchmark, and Gemma 3 outperformed it on the MATH benchmark.

Apart from Hugging Face, the new model is also available via the application programming interface (API) on Mistral AI's developer playground La Plateforme, as well as on Google Cloud's Vertex AI. It will also be made available on Nvidia's NIM and Microsoft's Azure AI Foundry in the coming weeks.

 

Catch the latest from the Consumer Electronics Show on Gadgets 360, at our CES 2026 hub.

Advertisement

Related Stories

Popular Mobile Brands
  1. Apple Unveils Second-Generation AirTag With New UWB Chip, These Features
  2. NASA Gears Up for SpaceX Crew-12 Mission Ahead of February Launch
  1. Apple Launches Second-Generation AirTag With Higher Precision Finding Range, Louder Speaker
  2. Salliyargal Now Streaming Online: Where to Watch Karunaas and Sathyadevi Starrer Online?
  3. NASA’s Chandra Observatory Reveals 22 Years of Cosmic X-Ray Recordings
  4. Space Gen: Chandrayaan Now Streaming on JioHotstar: What You Need to Know About Nakuul Mehta and Shriya Saran Starrer
  5. NASA Evaluates Early Liftoff for SpaceX Crew-12 Following Rare ISS Medical Evacuation
  6. Raakaasa OTT Release Details: What You Need to Know About Niharika Konidela’s Horror Fantasy Film
  7. Sinking Calcium Carbonate Locked Away Greenhouse Gases, Reveals New Study
  8. Sarvam Maya Set for OTT Release on JioHotstar: All You Need to Know About Nivin Pauly’s Horror Comedy
  9. Europa’s Hidden Ocean Could Be ‘Fed’ by Sinking Salted Ice; New Study Boosts Hopes for Alien Life
  10. The Rookie Season 8 Now Available for Streaming Online: Where to Watch Nathan Fillion-Starrer Cop Drama Online?
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.