Search

Mistral Announces Pixtral 12B Multimodal AI Model With 'Computer Vision' Feature

Mistral’s Pixtral 12B AI model can accept images as input and answer queries about them.

Advertisement
Highlights
  • Pixtral 12B cannot generate images
  • Mistral’s new AI model has a size of 24GB
  • Pixtral 12B will also be available on le Chat and la Plateforme soon
Mistral Announces Pixtral 12B Multimodal AI Model With 'Computer Vision' Feature

Pixtral 12B is built on Mistral’s Nemo 12B large language model

Photo Credit: Unsplash/Solen Feyissa

Mistral released its first multimodal artificial intelligence (AI) model dubbed Pixtral 12B on Wednesday. The AI firm, known for its open-source large language models (LLMs), has also made the latest AI model available on GitHub and Hugging Face for users to download and test out. Notably, despite being multimodal, Pixtral can only process images using computer vision technology and answer queries about them. Two special encoders have been added for this functionality. It cannot generate images like the Stable Diffusion models or Midjourney's Generative Adversarial Networks (GANs).

Mistral Releases Pixtral 12B

Gaining a reputation for minimalist announcements, the official account of Mistral on X (formerly known as Twitter) released the AI model in a post by sharing its magnet link. The total file size of Pixtral 12B is 24GB, and it will require an NPU-enabled PC or one with a powerful GPU to run the model.

The Pixtral 12B comes with 12 billion parameters and is built using the company's existing Nemo 12B AI model. Mistral highlights users will also need the Gaussian Error Linear Unit (GeLU) as the vision adapter and 2D Rotary Position Embedding (RoPE) as the vision encoder.

Notably, users can upload image files or URLs to the Pixtral 12B and it should be able to answer queries about the image such as identifying the objects, counting the number of objects, and sharing additional information. Since it is built on Nemo, the model will also be adept at completing all the typical text-based tasks as well.

A Reddit user posted an image about the benchmarking scores of Pixtral 12B, and it appears that the LLM outperforms Claude-3 Haiku and Phi-3 Vision in multimodal capabilities on the ChartQA bench. It also outperforms both rival AI models on the Massive Multitask Language Understanding (MMLU) bench for multimodal knowledge and reasoning.

Citing the company spokesperson, TechCrunch reports that the Mistral AI model can be fine-tuned and used under an Apache 2.0 license. This means the outputs from the model can be used for personal or commercial usage without restrictions. Additionally, Sophia Yang, the Head of Developer Relations at Mistral clarified in a post that Pixtral 12B will soon be available on Le Chat and Le Platforme.

For now, users can directly download the AI model using the magnet link provided by the company. Alternatively, the model weights have also been hosted on Hugging Face and GitHub listings.

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

 
Show Full Article
Please wait...
Advertisement

Related Stories

Popular Mobile Brands
  1. OnePlus Nord CE 5 Review
  2. OnePlus Nord 5, OnePlus Nord CE 5 Launched in India at These Prices
  3. OnePlus Nord 5 Review
  4. Ai+ Wearbuds Smartwatch Launched in India With Built-In TWS Earbuds
  5. Samsung Galaxy Buds 3 Pro's Amazon Prime Day 2025 Offer Revealed
  6. Realme 15 5G, 15 Pro 5G to Launch in India on This Date
  7. Samsung Galaxy Z Fold 7, Z Flip 7, Z Flip 7 FE Specifications Leaked
  8. WhatsApp's AI-Powered Chat Wallpaper Feature Is Coming to iOS
  9. Oppo Reno 14 Gets a New Variant With a Colour Changing Rear Panel
  10. AI+ Pulse, AI+ Nova 5G With 50-Megapixel Rear Cameras Launched in India
  1. Vivo V60 Reportedly Listed on SIRIM and TUV Websites, Could Launch Soon
  2. Amazon Prime Day 2025 Sale: iQOO 13, iQOO Neo 10R, iQOO Z10x and More to Go on Sale at Discounted Prices
  3. Swiggy Instamart Teams Up With Jio for Instant Delivery of JioBharat V4 and JioPhone Prima 2
  4. Apple Maps in iOS 26 Beta Version Come With An Upgraded Search Feature: Report
  5. WhatsApp Rolls Out AI-Powered Chat Wallpaper Feature; Threaded Message Replies Spotted in Development
  6. Samsung Galaxy Watch 8 Series Could Launch With Gemini Voice Assistant
  7. Amazon Prime Day 2025 Sale: Samsung Galaxy Buds 3 Pro to Be Available at a Discounted Price
  8. Oppo Reno 14 Launched in New Finish With Temperature-Sensitive Colour Changing Rear Panel
  9. Microsoft Edge Can Now Load Websites Faster After Migration to WebUI 2.0, Says Company
  10. Samsung Galaxy S25 FE to Sport a 6.7-Inch Flexible OLED Display: Report
Gadgets 360 is available in
Download Our Apps
App Store App Store
Available in Hindi
App Store
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.
Trending Products »
Latest Tech News »