Mistral Small 3.1 AI Model With Improved Text and Multimodal Performance Released

Mistral Small 3.1 AI model is a 24 billion parameter model with a context window of 1,28,000 tokens.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 18 March 2025 14:20 IST
Highlights
  • Mistral Small 3.1 can run on a single RTX 4090 or a Mac with 32GB RAM
  • It offers function calling and function execution for agentic workflow
  • Mistral Small 3.1 models are available on Hugging Face

Mistral is offering the AI models with an Apache 2.0 licence

Photo Credit: Unsplash/Solen Feyissa

Mistral Small 3.1 artificial intelligence (AI) model was released on Monday. The Paris-based AI firm introduced two open-source variants of the latest model — chat and instruct. The model comes as the successor to the Mistral Small 3, and offers improved text performance and multimodal understanding. The company claims that it outperforms comparable models such as Google's Gemma 3 and OpenAI's GPT-4o mini on several benchmarks. One of the key advantages of the newly introduced model is its rapid response times.

Mistral Small 3.1 AI Model Released

In a newsroom post, the AI firm detailed the new models. The Mistral Small 3.1 comes with an expanded context window of up to 1,28,000 tokens and is said to deliver inference speeds of 150 tokens per second. This essentially means the response time of the AI model is quite fast. It arrives in two variants of chat and instruct. The former works as a typical chatbot whereas the latter is fine-tuned to follow user instructions and is useful when building an application with a specific purpose.

Mistral Small 3.1 benchmark
Photo Credit: Mistral

Advertisement

 

Similar to its previous releases, the Mistral Small 3.1 is available in the public domain. The open weights can be downloaded from the firm's Hugging Face listing. The AI model comes with an Apache 2.0 licence which allows academic and research usage but forbids commercial use cases.

Advertisement

Mistral said that the large language model (LLM) is optimised to run on a single Nvidia RTX 4090 GPU or a Mac device with 32GB RAM. This means enthusiasts without an expensive setup to run AI models can also download and access it. The model also offers low-latency function calling and function execution which can be useful for building automation and agentic workflows. The company also allows developers to fine-tune the Mistral Small 3.1 to fit the use cases of specialised domains.

Advertisement

Coming to performance, the AI firm shared various benchmark scores based on internal testing. The Mistral Small 3.1 is said to outperform Gemma 3 and GPT-4o mini on the Graduate-Level Google-Proof Q&A (GPQA) Main and Diamond, HumanEval, MathVista, and the DocVQA benchmarks. However, GPT-4o mini performed better on the Massive Multitask Language Understanding (MMLU) benchmark, and Gemma 3 outperformed it on the MATH benchmark.

Apart from Hugging Face, the new model is also available via the application programming interface (API) on Mistral AI's developer playground La Plateforme, as well as on Google Cloud's Vertex AI. It will also be made available on Nvidia's NIM and Microsoft's Azure AI Foundry in the coming weeks.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Motorola Edge 70 Ultra Camera Configuration, Other Key Features Leaked
  2. Tomb Raider, Star Wars, Divinity: Everything Announced at The Game Awards
  3. The Rookie Season 7 OTT Release Date: When and Where to Watch it Online?
  4. Hogwarts Legacy Is Currently Free on Epic Games Store: How to Redeem
  5. The Game Awards 2025: See the Full List of Winners
  6. Vivo S50, Vivo S50 Pro Mini Specifications Revealed Through China Telecom
  7. Samsung Galaxy S26 Ultra May Launch With This Long-Awaited Charging Upgrade
  8. Nothing Phone 4a Series Price and Key Specs Tipped
  9. Dominic and the Ladies' Purse OTT Release Date: When and Where to Watch it Online?
  1. Astronomers Observe Star’s Wobbling Orbit, Confirming Einstein’s Frame-Dragging
  2. Galaxy Collisions Found to Activate Supermassive Black Holes, Euclid Data Shows
  3. JWST Detects Oldest Supernova Ever Seen, Linked to GRB 250314A
  4. Chandra’s New X-Ray Mapping Exposes the Invisible Engines Powering Galaxy Clusters
  5. Blue Origin to Fly First Wheelchair User to Space on New Shepard NS-37
  6. Chandra’s New X-Ray Mapping Exposes the Invisible Engines Powering Galaxy Clusters
  7. Sasivadane Now Streaming on Amazon Prime Video: Everything You Need to Know
  8. Kuttram Purindhavan Now Streaming Online: What You Need to Know?
  9. Lyne Lancer 19 Pro With 2.01-Inch Display, SpO2 Monitoring Launched in India
  10. OpenAI and Disney Reach Licensing Agreement to Bring Its Characters to the Sora App
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.