Meta Llama 4 Scout and Maverick AI Models With MoE Architecture Released

Meta has also previewed Llama 4 Behemoth, the largest AI model in the family so far, with 288 billion active parameters.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 7 April 2025 12:13 IST
Highlights
  • The Llama 4 Scout is a 17 billion active parameter model with 16 experts
  • The Maverick model has 17 billion active parameters and 128 experts
  • Llama 4 Behemoth is said to outperform GPT-4.5 and Gemini 2.0 Pro

Both Llama 4 Scout and Maverick are available on Hugging Face and the Llama website

Photo Credit: Meta

Meta introduced the first artificial intelligence (AI) models in the Llama 4 family on Saturday. The Menlo Park-based tech giant released two models — Llama 4 Scout and Llama 4 Maverick — with native multimodal capabilities to the open community. The company says these are the first open models built with Mixture-of-Experts (MoE) architecture. Compared to the predecessor, these come with higher context windows and better power efficiency. Alongside, Meta also previewed Llama 4 Behemoth, the largest AI model in the family unveiled so far.

Meta Llama 4 AI Models Arrive With MoE Architecture

In a blog post, the tech giant detailed its new AI models. Just like the previous Llama models, the Llama 4 Scout and Llama 4 Maverick are open-source AI models and can be downloaded via its Hugging Face listing or the dedicated Llama website. Starting today, users can also experience the Llama 4 AI models in WhatsApp, Messenger, Instagram Direct, and on the Meta.AI website.

Advertisement

The Llama 4 Scout is a 17 billion active parameter model with 16 experts, whereas the Maverick model comes with 17 billion active parameters and 128 experts. Scout is said to be able to run on a single Nvidia H100 GPU. Additionally, the company claimed that the previewed Llama 4 Behemoth outperforms GPT-4.5, Claude Sonnet 3.7, and Gemini 2.0 Pro on several benchmarks. Meta said the Behemoth model, with 288 billion active parameters and 16 experts, was not released as it is still being trained.

The MoE architecture in Llama 4 AI models
Photo Credit: Meta

Advertisement

 

Coming to the architecture, the Llama 4 models are built on an MoE architecture. The MoE architecture activates only a fraction of the total parameters based on the requirement of the initial prompt, which makes it more compute efficient for training and inference. In the pre-training phase, Meta also used new techniques such as early fusion to integrate text and vision tokens simultaneously, and MetaP to set critical model hyper-parameters and initialisation scales.

Advertisement

For post-training, Meta chose to start the process with lightweight supervised fine-tuning (SFT), followed by online reinforcement learning (RL) and lightweight direct preference optimisation (DPO). The sequence was chosen to not over-constrain the model. The researchers also performed SFT on only 50 percent of the “harder” dataset.

Based on internal testing, the company claimed that the Maverick model outperforms Gemini 2.0 Flash, DeepSeek v3.1, and GPT-4o on the MMMU (image reasoning), ChartQA (image understanding), GPQA Diamond (reasoning and knowledge), and MTOB (long context) benchmarks.

Advertisement

On the other hand, the Scout model is said to outperform Gemma 3, Mistral 3.1, and Gemini 2.0 on the MMMU, ChartQA, MMLU (reasoning and knowledge), GPQA Diamond, and MTOB benchmarks.

Meta has also taken steps to make the AI models safer in both the pre-training and post-training processes. In pre-training, the researchers used data filtering methods to ensure harmful data was not added to its knowledge base. In post-training, the researchers added open-source safety tools such as Llama Guard and Prompt Guard to protect the model from external attacks. Additionally, the researchers have also stress-tested the models internally and have allowed red-teaming of the Llama 4 Scout and Maverick models.

Notably, the models are available to the open community with a permissive Llama 4 licence. It allows both academic and commercial usage of the models, however, Meta no longer allows companies with more than 700 million monthly active users to access its AI models.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. iPhone 18 Pro Max Could Fit Existing iPhone 17 Pro Max Cases
  2. Microsoft Surface, Surface Pro Launched With Snapdragon X2 Chips: See Price
  3. Xbox Game Pass Is Adding EA Sports FC 26, Call of Duty: Vanguard and More
  4. The OnePlus 15R Is Now Available in a New 16GB RAM Variant at This Price
  5. OnePlus N6 Confirmed to Launch in India With an 8,000mAh Battery
  1. Scientists Discover Giant Planet Formation Around Supermassive Black Holes
  2. EA Sports FC 26, Call of Duty: Vanguard and More Coming to Xbox Game Pass This Month
  3. Vivo Y500 4G Global Launch Teased; Confirmed to Debut With 8,100mAh Battery
  4. WhatsApp Working on Voice Note Widget for Quick Access via Android Home Screen
  5. Honor X80 Pro Max Teased With 10,000 Nits Display Ahead of June 22 Launch
  6. Binance Defends EU Licence Compliance Following Reports of Possible Rejection
  7. OnePlus 15R Now Available in New 16GB RAM Variant in India With Higher Price Tag: Specifications, Features
  8. Google Extends Android's Parental Controls Beyond Pixel Phones With Android 17
  9. iPhone 18 Pro Max Dummies Hint at Case Compatibility With iPhone 17 Pro Max Despite Thicker Camera Bump
  10. Lenovo Yoga Pro 9n Design Renders, Key Specifications Leaked; Nvidia RTX Spark-Powered Laptop Could Launch Soon
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.