Cohere for AI Releases Open-Source Aya Vision Models for Computer Vision-Based Tasks

Cohere is also making its models accessible via WhatsApp for free.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 5 March 2025 17:07 IST
Highlights
  • Cohere’s Aya Vision models can generate output in 23 languages
  • Aya Vision is available in 8B and 32B parameter sizes
  • The models are said to outperform Meta’s Llama-3.2 90B Vision

The open-source models are available for non-commercial purposes

Photo Credit: Cohere

Cohere For AI, the firm's open research division, released new state-of-the-art (SOTA) vision models on Tuesday. Dubbed Aya Vision, the artificial intelligence (AI) models are available in two parameter sizes. The company's latest frontier models address the inconsistent performance of existing large language models (LLMs) across different languages, especially for multimodal tasks. Aya Vision models can generate outputs in 23 languages and can perform both text-based and image-based tasks. However, it cannot generate images. Cohere has made the AI models available on open-source repositories as well as via WhatsApp.

Cohere Releases Aya Vision AI Models

In a blog post, the AI firm detailed the new vision models. Aya Vision is available in 8B and 32B parameter sizes. These models can generate text, translate text and images across 23 languages, analyse images and answer queries about them, as well as caption images. Both models can be accessed via Cohere's Hugging Face page and on Kaggle.

Additionally, general users can try out Cohere's models via a dedicated WhatsApp chat account that can be accessed here. The company says the Aya Vision models are useful for instances when people come across images or artworks they would like to learn more about.

Advertisement

Based on the company's internal testing, the Aya Vision 8B model outperforms Qwen2.5-VL 7B, Gemini Flash 1.5 8B, and Llama 3.2 11B Vision models on the AyaVisionBench and m-WildVision benchmarks. Notably, the AyaVisionBench benchmark was also developed by Cohere, and its details have been shared in the public domain.

Advertisement

Coming to the Aya Vision 32B model, the company claimed that it outperformed Llama 3.2 90B Vision and Qwen2-VL 72B models on the same benchmarks.

To achieve frontier performance, Cohere claimed that several algorithmic innovations were developed. The Aya Vision models were fed synthetic annotations, developers scaled up multilingual data through translation and rephrasing, and multiple multimodal models were merged in separate steps. The developers observed that in each step, the performance was significantly improved.

Advertisement

Notably, developers can access the open weights of the Aya Vision models from Kaggle and Hugging Face, however, these models are available with a Creative Commons Attribution Non Commercial 4.0 license. It allows for academic and research-based usage but prohibits commercial use cases.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Xiaomi 17 Ultra Finally Arrives in India at This Price
  2. DxOMark Ranks iPhone 17 Pro Above Galaxy S26 Ultra in Camera Performance
  3. Samsung Galaxy A57 Renders Leak Online Again; Launch Expected Soon
  4. Samsung Galaxy S26 Series Goes on Sale in India: See Price, Features
  5. Vivo Y51 Pro 5G Launched With 7,200mAh Battery at This Price in India
  6. Xiaomi 17 Launched in India With Snapdragon 8 Elite Gen 5, Leica Cameras
  7. Canva's AI-Powered Magic Layers Turns Images Into Editable Designs
  8. Poco X8 Pro Series Confirmed to Launch in India With This Battery
  1. James Webb Telescope Captures Rare Infrared Footprints of Io and Ganymede Inside Jupiter’s Auroras
  2. WhatsApp Adds Support for Parent-Managed Accounts With Stricter Controls for Children Under 13
  3. Crimson Desert PC and Console Specs Revealed: Here's How the Game Will Run on PS5 and Xbox Series S/X
  4. Perplexity Ordered to Stop Deploying Shopping AI Agents on Amazon: Report
  5. Sonos Play and Sonos Era 100 SL Launched With Wi-Fi 6 Connectivity, AirPlay 2 Support: Price, Features
  6. Oppo Find N6 Colourways, Storage Variants Revealed as Company Teases Crease-Free Display's Components
  7. Canva’s New AI-Powered Magic Layers Feature Turns Images Into Editable Designs
  8. Tokenised Real-World Assets See 66 Percent Jump in 2026, DeFiLlama Data Shows
  9. The Society Season 2 OTT Release: Where to Watch Munawar Faruqui and Shreya Kalra’s Reality Survival Series?
  10. YouTube’s Likeness Detection Tool Expanded to Government Officials and Journalists
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.