Alibaba Researchers Unveil Marco-o1 AI Model As Another Reasoning-Focused Competitor to OpenAI’s o1

Alibaba’s Marco-o1 AI model is available to download and use on Hugging Face.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 2 December 2024 16:19 IST
Highlights
  • Marco-o1 is a distilled version of the Qwen2-7B-Instruct
  • Alibaba’s AI model is fine-tuned using chain-of-thought (CoT) method
  • Alibaba recently released QwQ-32B reasoning-focused AI model

The company says Marco-o1 is optimised for complex real-world problem-solving tasks

Photo Credit: Unsplash/Markus Spiske

Alibaba recently introduced a reasoning-focused artificial intelligence (AI) model dubbed Marco-o1. The model is similar to the QwQ-32B large language model, which is also optimised for tasks requiring advanced reasoning capabilities, however, one important distinction is that the Marco-o1 is a smaller model and is distilled from the Qwen2-7B-Instruct model. The Chinese tech giant claimed that several fine-tuning exercises have been used to make the new model reasoning-focused. Additionally, the researchers highlighted that it is optimised for complex real-world problem-solving tasks.

Alibaba Marco-o1 AI Model

The new AI model is detailed in a research paper published on arXiv, an online pre-print journal. Notably, the papers published in the online journal are not peer-reviewed. Additionally, Alibaba has also hosted the AI model on Hugging Face and has permitted downloading and using it for personal and commercial use cases under the Apache 2.0 licence.

Advertisement

However, it is not fully open-sourced as only the partial dataset has been made available. As such, users will not be able to replicate the model or break it down to analyse the architecture or components.

Coming to Marco-o1, it is fine-tuned from the Qwen2-7B-Instruct foundation model. In the paper, the researchers highlighted that the AI model is powered by chain-of-thought (CoT) fine-tuning, Monte Carlo Tree Search (MCTS), reflection mechanisms, and other reasoning strategies.

Advertisement

As a result, Alibaba's Marco-o1 can solve open-ended questions and find queries to responses “where clear standards are absent and rewards are challenging to quantify.” However, it should be understood that the advanced reasoning abilities have not come from any hardware or architectural advancement.

Instead, all reasoning models today use a technique called test-time compute that lets an AI model spend more processing time on a single query. This allows them to test out different theories to find the solution and fact-check themselves. As a result, these models are geared towards providing more accurate responses and completing complex tasks. One important area where Marco-o1 excels, as per the researchers, is understanding colloquial nuances and translating slang expressions.

Advertisement

One limitation of the AI model, as per the researchers, claimed that while Marco-o1 shows reasoning characteristics, “its performance still falls short of a fully realised” reasoning model.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Claude Is Doubling the Usage Limits for the Next Two Weeks: Details
  2. Samsung Galaxy A37, Galaxy A57 Spied in Leaked Hands-on Videos
  3. OnePlus Nord 6 Series India Launch Teased as New Model Surfaces Online
  4. Huawei Teases an Imminent Return to India With the Launch of This Tablet
  5. Best Colour Printers for Home Use in India From Top Brands
  1. Arc Raiders' AI Voice Lines Were Re-Recorded by Human Actors After Launch, Says Embark CEO
  2. Apple's iPhone 19e Said to Launch in 2028 With Upgraded LPTO OLED Display
  3. WLFI Governance Vote Passes Proposal Introducing Token Lock-Up Incentives
  4. Xiaomi Book Pro 14, Xiaomi Watch S5 China Launch Date Announced; Key Features Teased
  5. Realme C100 5G Listed on Retail Website With 6.8-Inch Display and 7,000mAh Battery
  6. Anthropic Doubles Claude’s Usage Limits for the Next Two Weeks: Details
  7. Australian Lawmakers Advance New Bill to Regulate Crypto Platforms
  8. Poco X8 Pro, Poco X8 Pro Max Camera Configuration and Display Features Revealed
  9. JBL Grip Portable Speaker With AI Sound Boost, Up to 12 Hours Battery Life Launched in India: Price, Features
  10. Samsung Begins Testing One UI 9 Beta for Galaxy S26 Ultra Ahead of Android 17 Release: Report
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.