Meta Llama 4 Scout and Maverick AI Models With MoE Architecture Released

Meta has also previewed Llama 4 Behemoth, the largest AI model in the family so far, with 288 billion active parameters.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 7 April 2025 12:13 IST
Highlights
  • The Llama 4 Scout is a 17 billion active parameter model with 16 experts
  • The Maverick model has 17 billion active parameters and 128 experts
  • Llama 4 Behemoth is said to outperform GPT-4.5 and Gemini 2.0 Pro

Both Llama 4 Scout and Maverick are available on Hugging Face and the Llama website

Photo Credit: Meta

Meta introduced the first artificial intelligence (AI) models in the Llama 4 family on Saturday. The Menlo Park-based tech giant released two models — Llama 4 Scout and Llama 4 Maverick — with native multimodal capabilities to the open community. The company says these are the first open models built with Mixture-of-Experts (MoE) architecture. Compared to the predecessor, these come with higher context windows and better power efficiency. Alongside, Meta also previewed Llama 4 Behemoth, the largest AI model in the family unveiled so far.

Meta Llama 4 AI Models Arrive With MoE Architecture

In a blog post, the tech giant detailed its new AI models. Just like the previous Llama models, the Llama 4 Scout and Llama 4 Maverick are open-source AI models and can be downloaded via its Hugging Face listing or the dedicated Llama website. Starting today, users can also experience the Llama 4 AI models in WhatsApp, Messenger, Instagram Direct, and on the Meta.AI website.

The Llama 4 Scout is a 17 billion active parameter model with 16 experts, whereas the Maverick model comes with 17 billion active parameters and 128 experts. Scout is said to be able to run on a single Nvidia H100 GPU. Additionally, the company claimed that the previewed Llama 4 Behemoth outperforms GPT-4.5, Claude Sonnet 3.7, and Gemini 2.0 Pro on several benchmarks. Meta said the Behemoth model, with 288 billion active parameters and 16 experts, was not released as it is still being trained.

Advertisement

The MoE architecture in Llama 4 AI models
Photo Credit: Meta

Advertisement

 

Coming to the architecture, the Llama 4 models are built on an MoE architecture. The MoE architecture activates only a fraction of the total parameters based on the requirement of the initial prompt, which makes it more compute efficient for training and inference. In the pre-training phase, Meta also used new techniques such as early fusion to integrate text and vision tokens simultaneously, and MetaP to set critical model hyper-parameters and initialisation scales.

Advertisement

For post-training, Meta chose to start the process with lightweight supervised fine-tuning (SFT), followed by online reinforcement learning (RL) and lightweight direct preference optimisation (DPO). The sequence was chosen to not over-constrain the model. The researchers also performed SFT on only 50 percent of the “harder” dataset.

Based on internal testing, the company claimed that the Maverick model outperforms Gemini 2.0 Flash, DeepSeek v3.1, and GPT-4o on the MMMU (image reasoning), ChartQA (image understanding), GPQA Diamond (reasoning and knowledge), and MTOB (long context) benchmarks.

Advertisement

On the other hand, the Scout model is said to outperform Gemma 3, Mistral 3.1, and Gemini 2.0 on the MMMU, ChartQA, MMLU (reasoning and knowledge), GPQA Diamond, and MTOB benchmarks.

Meta has also taken steps to make the AI models safer in both the pre-training and post-training processes. In pre-training, the researchers used data filtering methods to ensure harmful data was not added to its knowledge base. In post-training, the researchers added open-source safety tools such as Llama Guard and Prompt Guard to protect the model from external attacks. Additionally, the researchers have also stress-tested the models internally and have allowed red-teaming of the Llama 4 Scout and Maverick models.

Notably, the models are available to the open community with a permissive Llama 4 licence. It allows both academic and commercial usage of the models, however, Meta no longer allows companies with more than 700 million monthly active users to access its AI models.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. iPhone 17 Price: US vs UAE vs India - Where Is It Cheapest to Buy?
  2. Pixel 9 for Under Rs. 36,000? Flipkart's Big Billion Days Deal Revealed
  3. Apple Launches iPhone 17 Pro, 17 Pro Max With These Massive Upgrades
  4. All the Key Differences Between iPhone 17 and iPhone 17 Pro
  5. Apple Watch Series 11, Ultra 3, SE Launched With These Health Features
  6. Who Is Abidur Chowdhury, the Designer Who Introduced the iPhone Air?
  7. Apple Discontinues These iPhone Models After iPhone 17 Launch
  8. iPhone 17 Price in India: See the Full Price List for All New Devices
  9. This Is When iOS 26, watchOS 26 Will Be Released to Eligible Devices
  10. iPhone 17 Models Support Faster Charging With Apple's New Dynamic Adapter
  1. Apple Introduces Memory Integrity Enforcement to Protect iPhone 17 Series from Sophisticated Malware Attacks
  2. Apple's iOS 26 RC Update Adds Icon Tinting Feature to Match Your iPhone or MagSafe Case
  3. iPhone 17 Models Support Faster Wired Charging With Apple’s New Dynamic Adapter
  4. Nothing OS 4.0 With Android 16 Confirmed to Launch Soon, Design Teased Ahead of Rollout
  5. Google AI Plus Subscription Plan Launched With Affordable Pricing, Access to Veo 3 Fast
  6. GTA 6 Delay Led to Celebrations at Sucker Punch, Ghost of Yotei Director Says
  7. OnePlus 15 Tipped to Launch in Three Colour Options Ahead of Anticipated Debut
  8. Apple's AirPods Pro 3 Has a Live Translation Feature That Will Come to the AirPods Pro 2, AirPods 4
  9. iPhone 17 vs iPhone 16: Here Is a Quick Comparison of Advertised Video Playback Times
  10. The New AirPods Lineup for 2025: AirPods Pro 3 Arrives, Pro 2 Departs
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.