Mistral Small 3.1 AI Model With Improved Text and Multimodal Performance Released

Mistral Small 3.1 AI model is a 24 billion parameter model with a context window of 1,28,000 tokens.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 18 March 2025 14:20 IST
Highlights
  • Mistral Small 3.1 can run on a single RTX 4090 or a Mac with 32GB RAM
  • It offers function calling and function execution for agentic workflow
  • Mistral Small 3.1 models are available on Hugging Face
Mistral Small 3.1 AI Model With Improved Text and Multimodal Performance Released

Mistral is offering the AI models with an Apache 2.0 licence

Photo Credit: Unsplash/Solen Feyissa

Mistral Small 3.1 artificial intelligence (AI) model was released on Monday. The Paris-based AI firm introduced two open-source variants of the latest model — chat and instruct. The model comes as the successor to the Mistral Small 3, and offers improved text performance and multimodal understanding. The company claims that it outperforms comparable models such as Google's Gemma 3 and OpenAI's GPT-4o mini on several benchmarks. One of the key advantages of the newly introduced model is its rapid response times.

Mistral Small 3.1 AI Model Released

In a newsroom post, the AI firm detailed the new models. The Mistral Small 3.1 comes with an expanded context window of up to 1,28,000 tokens and is said to deliver inference speeds of 150 tokens per second. This essentially means the response time of the AI model is quite fast. It arrives in two variants of chat and instruct. The former works as a typical chatbot whereas the latter is fine-tuned to follow user instructions and is useful when building an application with a specific purpose.

Mistral Small 3.1 benchmark
Photo Credit: Mistral

 

Similar to its previous releases, the Mistral Small 3.1 is available in the public domain. The open weights can be downloaded from the firm's Hugging Face listing. The AI model comes with an Apache 2.0 licence which allows academic and research usage but forbids commercial use cases.

Advertisement

Mistral said that the large language model (LLM) is optimised to run on a single Nvidia RTX 4090 GPU or a Mac device with 32GB RAM. This means enthusiasts without an expensive setup to run AI models can also download and access it. The model also offers low-latency function calling and function execution which can be useful for building automation and agentic workflows. The company also allows developers to fine-tune the Mistral Small 3.1 to fit the use cases of specialised domains.

Advertisement

Coming to performance, the AI firm shared various benchmark scores based on internal testing. The Mistral Small 3.1 is said to outperform Gemma 3 and GPT-4o mini on the Graduate-Level Google-Proof Q&A (GPQA) Main and Diamond, HumanEval, MathVista, and the DocVQA benchmarks. However, GPT-4o mini performed better on the Massive Multitask Language Understanding (MMLU) benchmark, and Gemma 3 outperformed it on the MATH benchmark.

Apart from Hugging Face, the new model is also available via the application programming interface (API) on Mistral AI's developer playground La Plateforme, as well as on Google Cloud's Vertex AI. It will also be made available on Nvidia's NIM and Microsoft's Azure AI Foundry in the coming weeks.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. OnePlus 13s Set to Launch in India Tomorrow: Know Price, Specifications
  2. Vivo T4 Ultra to Launch in India on This Date
  3. OnePlus 13s Key Specifications, Features Revealed via Amazon Listing
  4. Samsung Galaxy S26 Could Drop Gemini in Favour of This AI Assistant
  1. Samsung Galaxy Z Fold 7, Galaxy Z Flip 7, Galaxy Z Flip 7 FE Colourways, RAM and Storage Options Leaked Ahead of Debut
  2. Australia Limits Crypto ATM Transactions to AUD 5,000 in Bid to Curb Scams, Money Laundering
  3. Google Opens Access to Gemini 2.5 Native Audio Dialog and Controllable Speech Generation in Preview
  4. Vi, Vivo Partner to Offer Vivo V50e Buyers in India an Exclusive 5G Bundled Plan
  5. Google Weather in Search Reportedly Testing AI-Powered Summaries In Some Cities
  6. iPhone 18 Pro, iPhone 18 Pro Max and iPhone 18 Fold Said to Debut With 2nm A20 Chipset in 2026
  7. Perplexity Could be Default AI Assistant on Samsung Galaxy S26 as Part of 'Wide-Ranging' Deal: Report
  8. OnePlus 13s Key Specifications Revealed via Amazon Listing Ahead of June 5 Launch
  9. Nizharkudai Now Streaming on Aha Tamil: What You Need to Know About Tamil Family Drama
  10. Sony's State of Play Broadcast Announced for June 4: How to Watch, What to Expect
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.