Mistral Small 3.1 AI Model With Improved Text and Multimodal Performance Released

Mistral Small 3.1 AI model is a 24 billion parameter model with a context window of 1,28,000 tokens.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 18 March 2025 14:20 IST
Highlights
  • Mistral Small 3.1 can run on a single RTX 4090 or a Mac with 32GB RAM
  • It offers function calling and function execution for agentic workflow
  • Mistral Small 3.1 models are available on Hugging Face

Mistral is offering the AI models with an Apache 2.0 licence

Photo Credit: Unsplash/Solen Feyissa

Mistral Small 3.1 artificial intelligence (AI) model was released on Monday. The Paris-based AI firm introduced two open-source variants of the latest model — chat and instruct. The model comes as the successor to the Mistral Small 3, and offers improved text performance and multimodal understanding. The company claims that it outperforms comparable models such as Google's Gemma 3 and OpenAI's GPT-4o mini on several benchmarks. One of the key advantages of the newly introduced model is its rapid response times.

Mistral Small 3.1 AI Model Released

In a newsroom post, the AI firm detailed the new models. The Mistral Small 3.1 comes with an expanded context window of up to 1,28,000 tokens and is said to deliver inference speeds of 150 tokens per second. This essentially means the response time of the AI model is quite fast. It arrives in two variants of chat and instruct. The former works as a typical chatbot whereas the latter is fine-tuned to follow user instructions and is useful when building an application with a specific purpose.

Advertisement

Mistral Small 3.1 benchmark
Photo Credit: Mistral

 

Similar to its previous releases, the Mistral Small 3.1 is available in the public domain. The open weights can be downloaded from the firm's Hugging Face listing. The AI model comes with an Apache 2.0 licence which allows academic and research usage but forbids commercial use cases.

Mistral said that the large language model (LLM) is optimised to run on a single Nvidia RTX 4090 GPU or a Mac device with 32GB RAM. This means enthusiasts without an expensive setup to run AI models can also download and access it. The model also offers low-latency function calling and function execution which can be useful for building automation and agentic workflows. The company also allows developers to fine-tune the Mistral Small 3.1 to fit the use cases of specialised domains.

Advertisement

Coming to performance, the AI firm shared various benchmark scores based on internal testing. The Mistral Small 3.1 is said to outperform Gemma 3 and GPT-4o mini on the Graduate-Level Google-Proof Q&A (GPQA) Main and Diamond, HumanEval, MathVista, and the DocVQA benchmarks. However, GPT-4o mini performed better on the Massive Multitask Language Understanding (MMLU) benchmark, and Gemma 3 outperformed it on the MATH benchmark.

Apart from Hugging Face, the new model is also available via the application programming interface (API) on Mistral AI's developer playground La Plateforme, as well as on Google Cloud's Vertex AI. It will also be made available on Nvidia's NIM and Microsoft's Azure AI Foundry in the coming weeks.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. iQOO Z11 Global Variant Visits Geekbench With a Different Snapdragon Chip
  2. These Four Xiaomi Phones Are Now Eligible to Get Android 17 Beta Updates
  3. You Can Now Turn Your PS5 Into a Linux Gaming PC
  4. Valathu Vashathe Kallan OTT Release: Where to Watch Malayalam Crime Thriller Online
  5. Exam OTT Release Date Confirmed: All You Need to Know About This Series
  6. EA Sports FC 26, Wuchang: Fallen Feathers and Nine Sols Join PS Plus in May
  7. Sony Issues Statement on New DRM Check for PS5, PS4 Games After Backlash
  8. OnePlus Pad 4 Launched in India With Flagship Chip and These Features
  9. Raakaasa OTT Release Date Confirmed: Know When and Where to Watch it Online
  10. How to Prepare Your MacBook for Sale or Trade-In: A Step-by-Step Guide
  1. ULA Atlas V Launches 29 Amazon Kuiper Satellites in Return Mission
  2. Moto Buds 2 Plus Launched in India With Hi-Res Audio, Up to 40 Hours of Total Playback Time: Price, Features
  3. iQOO Z11 Global Variant Spotted on Geekbench Database With Snapdragon Chipset, Unlike Chinese Model
  4. Samsung Reportedly Plans to Launch Galaxy Book Models With Android-Based One UI 9 Soon
  5. PS5 Linux Loader Gets Public Release, Allowing Users to Run Steam and PC Games on Console
  6. Nine Crypto Scam Centres Targeting US Users Shut Down in Joint Operation Involving UAE, US and China
  7. Google Photos Unveils New AI-Powered Wardrobe Feature to Help You Decide What to Wear
  8. OpenAI CEO Sam Altman Teases GPT-5.5 Cyber AI Model Rollout, Could Take On Anthropic’s Claude Mythos
  9. Vivo X Fold 6 Leaks Hint at 200-Megapixel Camera, MediaTek Dimensity 9500 Chip and 7,000mAh Battery
  10. Raakaasa OTT Release Date Confirmed: Know When and Where to Watch it Online
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.