Mistral Releases Voxtral, Its First Open-Source Speech Generation AI Models With Native Language Understanding

Mistral’s Voxtral AI model is available in two sizes of 24 billion parameters and three billion parameters.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 16 July 2025 13:11 IST
Highlights
  • Voxtral is available with a permissive Apache 2.0 licence
  • It can transcribe 30-minute-long audio or understand 40-minute-long audio
  • Mistral’s new speech generation model can detect multiple languages

Voxtral Small outperforms GPT-4o-mini and Gemini 2.5 Flash in speech translation

Photo Credit: Unsplash/Solen Feyissa

Mistral released its first speech understanding models on Tuesday. Dubbed Voxtral, it is an open-source audio generation artificial intelligence (AI) model that not only turns text into speech but can also understand text to generate speech as a response natively. These models are available in two sizes of 24 billion parameters and three billion parameters. The Paris-based AI firm highlighted that not only is Voxtral available to download for free, but the company is also making it available at an affordable rate via application programming interface (API).

Mistral Brings an Open Solution for Native Speech Generation

In a newsroom post, Mistral calls voice “humanity's first interface,” highlighting it as a foundational pillar of communication. As AI models become more capable, the French AI company said it was important to bring human-computer interactions to this natural interface.

Advertisement

However, there are some gaps in this effort. Mistral claimed today's voice-focused AI models can be grouped in two categories: open-source models that have a high word error rate and limited semantic understanding; and closed proprietary models that are very expensive and not accessible to all.

Voxtral, an open-source model with native semantic understanding, is aimed at closing this gap, the company added. There are three models in total — Voxtral Small with 24B parameters, Voxtral Mini with 3B parameters, and Voxtral Mini Transcribe with 3B parameters. All of these models are available to the open community with the Apache 2.0 license that allows both academic and commercial usage.

Advertisement

Mistral claims Voxtral offers the best balance between performance and cost efficiency
Photo Credit: Mistral

Advertisement

 

Notably, Voxtral Small is the company's premium model aimed at production-scale applications, while the Voxtral Mini is designed for local and edge deployments. The Voxtral Mini Transcribe is focused on transcription-related tasks and is said to outperform OpenAI Whisper.

Advertisement

Voxtral models have a context window of 32,000 tokens, which translates to up to 30 minutes of transcription or 40 minutes of voice understanding. It can also answer questions about audio content and generate summaries natively. Additionally, Voxtral is also capable of detecting multiple languages, including English, Spanish, French, Portuguese, Hindi, German, Dutch, Italian, and more.

These models are built on top of Mistral Small 3.1, Voxtral models also offer function calling via voice, so users can command the AI system without having to type anything. Mistral claims that the Vostral Small model outperforms GPT-4o mini Transcribe and Gemini 2.5 Flash across tasks, and surpasses ElevenLabs Scribe in multilingual capabilities.

The Voxtral models can be downloaded from the company's Hugging Face listing, accessed via API at a starting price of $0.001 (roughly Re. 1) per minute, or can be tried out via Mistral's Le Chat platform.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Oppo Find X9 Ultra With 200-Megapixel Periscope Camera Launched Globally
  2. Poco M8s 5G Debuts Globally With 7,000mAh Battery: See Price, Features
  3. Oppo Find X9s Pro Launched With 200-Megapixel Cameras: See Price, Features
  4. Motorola Edge 70 Pro+ Leaked Renders Hint at Design, Five Colour Options
  5. Vivo X300 FE Roundup: Expected Price in India, Specifications
  6. Oppo Pad 5 Pro With 13,380mAh Battery Debuts Alongside Pad Mini: See Prices
  7. Jailer 2 OTT Release Date Reportedly Revealed Online: When and Where to Watch it Online?
  1. NASA Shuts Down Voyager 1 Instrument to Extend Mission Life in Deep Space
  2. Oppo Enco Clip 2 With Open-Ear Design, Up to 40 Hours Total Battery Life Launched Alongside Oppo Watch X3 Mini
  3. Vivo Y6t Launched With 6,500mAh Battery, Snapdragon 4 Gen 2 SoC: Price, Specifications
  4. OCBC Partners Lion Global Investors and DigiFT to Launch Tokenised Gold Fund With GOLDX Token
  5. Oppo Pad 5 Pro Launched With 13,380mAh Battery, Snapdragon 8 Elite Gen 5 SoC Alongside Oppo Pad Mini: Price, Features
  6. Redmi K90 Max Launched With Dimensity 9500 SoC, 8,550mAh Battery and Active Cooling Fan: Price, Specifications
  7. Oppo Find X9 Ultra Launched With Snapdragon 8 Elite Gen 5 SoC, 200-Megapixel Periscope Camera: Price, Specifications
  8. Oppo Find X9s Pro Launched With 200-Megapixel Cameras, 7,025mAh Battery: Price, Specifications
  9. OnePlus Ace 6 Ultra Geekbench Listing Reveals MediaTek Dimensity 9500 Chip, 16GB RAM
  10. Motorola Edge 70 Pro+ Leaked Renders Hint at Design, Five Colour Options
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.