Search

Alibaba’s Qwen Team Releases QVQ-72B Open Source Vision AI Model in Preview

The QVQ-72B AI model outperformed OpenAI o1 on the Math Vista benchmark.

Advertisement
Highlights
  • The QVQ-72B AI model combines vision and reasoning-based capabilities
  • Alibaba’s latest model is built on the Qwen2-VL-72B
  • It also scored 70.3 percent on the MMLU benchmark
Alibaba’s Qwen Team Releases QVQ-72B Open Source Vision AI Model in Preview

The QVQ-72B AI model can be accessed from Hugging Face

Photo Credit: Unsplash/Markus Winkler

Alibaba's Qwen research team has released another open-source artificial intelligence (AI) model in preview. Dubbed QVQ-72B, it is a vision-based reasoning model that can analyse visual information from images and understand the context behind them. The tech giant has also shared benchmark scores of the AI model and highlighted that on one specific test, it was able to outperform OpenAI's o1 model. Notably, Alibaba has released several open-source AI models recently, including the QwQ-32B and Marco-o1 reasoning-focused large language models (LLMs).

Alibaba's Vision-Based QVQ-72B AI Model Launched

In a Hugging Face listing, Alibaba's Qwen team detailed the new open-source AI model. Calling it an experimental research model, the researchers highlighted that the QVQ-72B comes with enhanced visual reasoning capabilities. Interestingly, these are two separate branches of performance, that the researchers have combined in this model.

Vision-based AI models are plenty. These include an image encoder and can analyse the visual information and context behind them. Similarly, reasoning-focused models such as o1 and QwQ-32B come with test-time compute scaling capabilities that allow them to increase the processing time for the model. This enables the model to break down the problem, solve it in a step-by-step manner, assess the output and correct it against a verifier.

With QVQ-72B's preview model, Alibaba has combined these two functionalities. It can now analyse information from images and answer complex queries by using reasoning-focused structures. The team highlights that it has significantly improved the performance of the model.

Sharing evals from internal testing, the researchers claimed that the QVQ-72B was able to score 71.4 percent in the MathVista (mini) benchmark, outperforming the o1 model (71.0). It is also said to score 70.3 percent on the Multimodal Massive Multi-task Understanding (MMMU) benchmark.

Despite the improved performance, there are several limitations, as is the case with most experimental models. The Qwen team stated that the AI model occasionally mixes different languages or unexpectedly switches between them. The code-switching issue is also prominent in the model. Additionally, the model is prone to getting caught in recursive reasoning loops, affecting the final output.

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Akash Dutta

Akash Dutta is a Senior Sub Editor at Gadgets 360. He is particularly interested in the social impact of techn... more

Please wait...
Advertisement

Related Stories

Popular Mobile Brands
  1. Realme C73 5G With 6,000mAh Battery Launched in India: See Price
  2. Google May Unveil the Pixel 10 Series Much Earlier Than Expected
  3. iQOO Neo 10 First Sale in India Kicks Off Today: Price, Offers and Features
  4. Exclusive: Huawei Band 10 to Launch in India Priced Under Rs. 5,000
  5. Supreme Court Upholds Dual Taxation on Cable TV, OTT Platforms: Report
  6. Honor Magic V5 May Offer the Biggest Battery Ever in a Foldable
  7. Vivo X Fold 5 Battery Details Leaked; May Be Cheaper Than X Fold 3 Pro
  8. Tecno Pova Curve 5G: Best Budget Camera Phone Of 2025? Honest Review!
  9. Apple's Upcoming Base iPhone 17 May Miss Out on Chipset Upgrade
  10. Redmi Pad 2 India Launch, Availability Details and Key Features Teased
  1. Supreme Court Upholds Constitutional Validity of Dual Taxation on Cable TV, OTT Platforms: Report
  2. Samsung Galaxy S25 Series Reportedly Receiving New One UI 7-Based Firmware Update in Europe
  3. Meta Reportedly Planning to Replace Human Reviewers With AI for Risk Assessment
  4. Redmi Pad 2 India Launch, Availability Details and Key Features Teased
  5. Outriders Developer People Can Fly Cancels Two Projects, Says Will 'Scale Down' Teams
  6. Honor Magic V5 Launch Timeline Leaked; May Pack the Biggest Battery Ever in a Foldable
  7. Nintendo Switch 2 Game Upgrade Pack Prices Revealed Ahead of June 5 Debut
  8. Exclusive: Huawei Band 10 With Stress Monitoring, More Features to Launch in India Under Rs. 5,000
  9. Samsung Galaxy S26 Series to Reportedly Include Perplexity App as Companies Near Major AI Deal
  10. Google Pixel 10 Series May Launch Earlier Than Usual, Suggests Alleged Pixel Superfans Invite
Gadgets 360 is available in
Download Our Apps
App Store App Store
Available in Hindi
App Store
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.
Trending Products »
Latest Tech News »