Search

Cohere for AI Releases Open-Source Aya Vision Models for Computer Vision-Based Tasks

Cohere is also making its models accessible via WhatsApp for free.

Advertisement
Highlights
  • Cohere’s Aya Vision models can generate output in 23 languages
  • Aya Vision is available in 8B and 32B parameter sizes
  • The models are said to outperform Meta’s Llama-3.2 90B Vision
Cohere for AI Releases Open-Source Aya Vision Models for Computer Vision-Based Tasks

The open-source models are available for non-commercial purposes

Photo Credit: Cohere

Cohere For AI, the firm's open research division, released new state-of-the-art (SOTA) vision models on Tuesday. Dubbed Aya Vision, the artificial intelligence (AI) models are available in two parameter sizes. The company's latest frontier models address the inconsistent performance of existing large language models (LLMs) across different languages, especially for multimodal tasks. Aya Vision models can generate outputs in 23 languages and can perform both text-based and image-based tasks. However, it cannot generate images. Cohere has made the AI models available on open-source repositories as well as via WhatsApp.

Cohere Releases Aya Vision AI Models

In a blog post, the AI firm detailed the new vision models. Aya Vision is available in 8B and 32B parameter sizes. These models can generate text, translate text and images across 23 languages, analyse images and answer queries about them, as well as caption images. Both models can be accessed via Cohere's Hugging Face page and on Kaggle.

Additionally, general users can try out Cohere's models via a dedicated WhatsApp chat account that can be accessed here. The company says the Aya Vision models are useful for instances when people come across images or artworks they would like to learn more about.

Based on the company's internal testing, the Aya Vision 8B model outperforms Qwen2.5-VL 7B, Gemini Flash 1.5 8B, and Llama 3.2 11B Vision models on the AyaVisionBench and m-WildVision benchmarks. Notably, the AyaVisionBench benchmark was also developed by Cohere, and its details have been shared in the public domain.

Coming to the Aya Vision 32B model, the company claimed that it outperformed Llama 3.2 90B Vision and Qwen2-VL 72B models on the same benchmarks.

To achieve frontier performance, Cohere claimed that several algorithmic innovations were developed. The Aya Vision models were fed synthetic annotations, developers scaled up multilingual data through translation and rephrasing, and multiple multimodal models were merged in separate steps. The developers observed that in each step, the performance was significantly improved.

Notably, developers can access the open weights of the Aya Vision models from Kaggle and Hugging Face, however, these models are available with a Creative Commons Attribution Non Commercial 4.0 license. It allows for academic and research-based usage but prohibits commercial use cases.

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

 
Show Full Article
Please wait...
Advertisement

Related Stories

Popular Mobile Brands
  1. Poco F7 Launch Date, Price in India, Design and Key Features Leaked Online
  2. OnePlus Nord 5 Series, OnePlus Buds 4 to Launch in India on This Date
  3. Vivo Y400 Pro 5G India Launch Date Confirmed; Design Revealed
  4. Oppo K13x 5G India Launch Date, Price Range and Key Features Revealed
  5. Hisense U7Q Mini-LED TV Launched in India With These Features
  6. Oppo Reno 14 5G Series, Watch X2 Mini, Enco Buds 3, Pad SE to Launch Globally
  7. You Can Now Download Generated Canvas in ChatGPT
  8. Lenovo Legion Pro 7i Refreshed With Intel Core Ultra 9 CPU, RTX 5090 GPUs
  9. Realme Narzo 80 Lite 5G Launched in India With 6,000mAh Battery: See Price
  10. Xiaomi Pad 7S Pro Launch Date, Key Specifications Revealed Ahead of Launch
  1. The Witcher 4 Will Target 60 FPS on Consoles, but Series S Will Be 'Extremely Challenging' Says CD Projekt Red
  2. Oppo Reno 14 5G Series Global Launch Teased Alongside Watch X2 Mini, Enco Buds 3 and Pad SE
  3. Microsoft Begins Testing AI Agents in Windows 11, Brings Option to Share Recall Snapshots in Europe
  4. watchOS 26 to Bring Control Center Customisation Options with User-Defined Toggles
  5. Tecno Pova 7 5G Series India Launch Teased; Confirmed to Be Available on Flipkart
  6. Oppo K13 Turbo Pro Key Specifications Leaked; Could Be Equipped With Snapdragon 8s Gen 4 SoC
  7. Lenovo Legion Pro 7i (2025) With Intel Core Ultra 9 HX CPU, Up to Nvidia GeForce RTX 5090 GPU Launched
  8. Hisense U7Q Mini-LED TV With 144Hz Gaming Support, Built-in Subwoofer Launched in India
  9. OnePlus Nord 5, Nord CE 5, and Buds 4 India Launch Date Set for July 8; Key Features, Availability Revealed
  10. OpenAI Makes Canvas in ChatGPT Downloadable, Adds New Capabilities to Projects
Gadgets 360 is available in
Download Our Apps
App Store App Store
Available in Hindi
App Store
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.
Trending Products »
Latest Tech News »