Meta Llama 4 Scout and Maverick AI Models With MoE Architecture Released

Meta has also previewed Llama 4 Behemoth, the largest AI model in the family so far, with 288 billion active parameters.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 7 April 2025 12:13 IST
Highlights
  • The Llama 4 Scout is a 17 billion active parameter model with 16 experts
  • The Maverick model has 17 billion active parameters and 128 experts
  • Llama 4 Behemoth is said to outperform GPT-4.5 and Gemini 2.0 Pro
Meta Llama 4 Scout and Maverick AI Models With MoE Architecture Released

Both Llama 4 Scout and Maverick are available on Hugging Face and the Llama website

Photo Credit: Meta

Meta introduced the first artificial intelligence (AI) models in the Llama 4 family on Saturday. The Menlo Park-based tech giant released two models — Llama 4 Scout and Llama 4 Maverick — with native multimodal capabilities to the open community. The company says these are the first open models built with Mixture-of-Experts (MoE) architecture. Compared to the predecessor, these come with higher context windows and better power efficiency. Alongside, Meta also previewed Llama 4 Behemoth, the largest AI model in the family unveiled so far.

Meta Llama 4 AI Models Arrive With MoE Architecture

In a blog post, the tech giant detailed its new AI models. Just like the previous Llama models, the Llama 4 Scout and Llama 4 Maverick are open-source AI models and can be downloaded via its Hugging Face listing or the dedicated Llama website. Starting today, users can also experience the Llama 4 AI models in WhatsApp, Messenger, Instagram Direct, and on the Meta.AI website.

The Llama 4 Scout is a 17 billion active parameter model with 16 experts, whereas the Maverick model comes with 17 billion active parameters and 128 experts. Scout is said to be able to run on a single Nvidia H100 GPU. Additionally, the company claimed that the previewed Llama 4 Behemoth outperforms GPT-4.5, Claude Sonnet 3.7, and Gemini 2.0 Pro on several benchmarks. Meta said the Behemoth model, with 288 billion active parameters and 16 experts, was not released as it is still being trained.

The MoE architecture in Llama 4 AI models
Photo Credit: Meta

Advertisement

 

Coming to the architecture, the Llama 4 models are built on an MoE architecture. The MoE architecture activates only a fraction of the total parameters based on the requirement of the initial prompt, which makes it more compute efficient for training and inference. In the pre-training phase, Meta also used new techniques such as early fusion to integrate text and vision tokens simultaneously, and MetaP to set critical model hyper-parameters and initialisation scales.

Advertisement

For post-training, Meta chose to start the process with lightweight supervised fine-tuning (SFT), followed by online reinforcement learning (RL) and lightweight direct preference optimisation (DPO). The sequence was chosen to not over-constrain the model. The researchers also performed SFT on only 50 percent of the “harder” dataset.

Based on internal testing, the company claimed that the Maverick model outperforms Gemini 2.0 Flash, DeepSeek v3.1, and GPT-4o on the MMMU (image reasoning), ChartQA (image understanding), GPQA Diamond (reasoning and knowledge), and MTOB (long context) benchmarks.

Advertisement

On the other hand, the Scout model is said to outperform Gemma 3, Mistral 3.1, and Gemini 2.0 on the MMMU, ChartQA, MMLU (reasoning and knowledge), GPQA Diamond, and MTOB benchmarks.

Meta has also taken steps to make the AI models safer in both the pre-training and post-training processes. In pre-training, the researchers used data filtering methods to ensure harmful data was not added to its knowledge base. In post-training, the researchers added open-source safety tools such as Llama Guard and Prompt Guard to protect the model from external attacks. Additionally, the researchers have also stress-tested the models internally and have allowed red-teaming of the Llama 4 Scout and Maverick models.

Notably, the models are available to the open community with a permissive Llama 4 licence. It allows both academic and commercial usage of the models, however, Meta no longer allows companies with more than 700 million monthly active users to access its AI models.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. Xiaomi Surpasses Apple to Lead Wearables Market in Q1 2025: Canalys
  2. Made in India iPhones for US to Be Cheaper, Even With Tariffs: Report
  1. X Restores Access After Thousands of Users Report X Website and App Not Working
  2. Made in India iPhones Will Still Be Cheaper in the US, Even With Donald Trump's 25 Percent Tariff: GTRI Report
  3. Xiaomi Surpasses Apple to Lead Wearables Market in Q1 2025 With 19 Percent Market Share: Canalys
  4. Vivo X200 FE Reportedly Listed on BIS, IMDA Certification Websites Ahead of Anticipated Launch in India
  5. Oracle Said to Buy $40 Billion of Nvidia Chips for OpenAI's US Data Center
  6. Trump Threatens 25 Percent Tariffs on Apple If iPhones Not Made in US
  7. iPhone 16 Pro Max, iPhone 15, MacBook Air (M4) and More Get Discounts During Vijay Sales Apple Days Sale
  8. Anthropic CEO Dario Amodei Says AI Models Hallucinate Less Than Humans: Report
  9. UK Government Updates Crypto Reporting Guidelines, Mandates Collection of Crypto Transaction Data
  10. Acer Swift Neo WIth Intel Core Ultra 5, Up to 32GB RAM Launched in India: Price, Specifications
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.