Meta Llama 4 Scout and Maverick AI Models With MoE Architecture Released

Meta has also previewed Llama 4 Behemoth, the largest AI model in the family so far, with 288 billion active parameters.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 7 April 2025 12:13 IST
Highlights
  • The Llama 4 Scout is a 17 billion active parameter model with 16 experts
  • The Maverick model has 17 billion active parameters and 128 experts
  • Llama 4 Behemoth is said to outperform GPT-4.5 and Gemini 2.0 Pro

Both Llama 4 Scout and Maverick are available on Hugging Face and the Llama website

Photo Credit: Meta

Meta introduced the first artificial intelligence (AI) models in the Llama 4 family on Saturday. The Menlo Park-based tech giant released two models — Llama 4 Scout and Llama 4 Maverick — with native multimodal capabilities to the open community. The company says these are the first open models built with Mixture-of-Experts (MoE) architecture. Compared to the predecessor, these come with higher context windows and better power efficiency. Alongside, Meta also previewed Llama 4 Behemoth, the largest AI model in the family unveiled so far.

Meta Llama 4 AI Models Arrive With MoE Architecture

In a blog post, the tech giant detailed its new AI models. Just like the previous Llama models, the Llama 4 Scout and Llama 4 Maverick are open-source AI models and can be downloaded via its Hugging Face listing or the dedicated Llama website. Starting today, users can also experience the Llama 4 AI models in WhatsApp, Messenger, Instagram Direct, and on the Meta.AI website.

The Llama 4 Scout is a 17 billion active parameter model with 16 experts, whereas the Maverick model comes with 17 billion active parameters and 128 experts. Scout is said to be able to run on a single Nvidia H100 GPU. Additionally, the company claimed that the previewed Llama 4 Behemoth outperforms GPT-4.5, Claude Sonnet 3.7, and Gemini 2.0 Pro on several benchmarks. Meta said the Behemoth model, with 288 billion active parameters and 16 experts, was not released as it is still being trained.

Advertisement

The MoE architecture in Llama 4 AI models
Photo Credit: Meta

Advertisement

 

Coming to the architecture, the Llama 4 models are built on an MoE architecture. The MoE architecture activates only a fraction of the total parameters based on the requirement of the initial prompt, which makes it more compute efficient for training and inference. In the pre-training phase, Meta also used new techniques such as early fusion to integrate text and vision tokens simultaneously, and MetaP to set critical model hyper-parameters and initialisation scales.

Advertisement

For post-training, Meta chose to start the process with lightweight supervised fine-tuning (SFT), followed by online reinforcement learning (RL) and lightweight direct preference optimisation (DPO). The sequence was chosen to not over-constrain the model. The researchers also performed SFT on only 50 percent of the “harder” dataset.

Based on internal testing, the company claimed that the Maverick model outperforms Gemini 2.0 Flash, DeepSeek v3.1, and GPT-4o on the MMMU (image reasoning), ChartQA (image understanding), GPQA Diamond (reasoning and knowledge), and MTOB (long context) benchmarks.

Advertisement

On the other hand, the Scout model is said to outperform Gemma 3, Mistral 3.1, and Gemini 2.0 on the MMMU, ChartQA, MMLU (reasoning and knowledge), GPQA Diamond, and MTOB benchmarks.

Meta has also taken steps to make the AI models safer in both the pre-training and post-training processes. In pre-training, the researchers used data filtering methods to ensure harmful data was not added to its knowledge base. In post-training, the researchers added open-source safety tools such as Llama Guard and Prompt Guard to protect the model from external attacks. Additionally, the researchers have also stress-tested the models internally and have allowed red-teaming of the Llama 4 Scout and Maverick models.

Notably, the models are available to the open community with a permissive Llama 4 licence. It allows both academic and commercial usage of the models, however, Meta no longer allows companies with more than 700 million monthly active users to access its AI models.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. Oppo Find X9 Series With Hasselblad-Tuned Cameras Launched Globally
  2. iQOO 15 Confirmed to Launch in India on This Date
  3. iPhone 17 Review
  4. Massive Data Breach Leaves 183 Million Email Accounts Exposed: Details
  5. Oppo Find X9 Series Exchange Offers, Benefits Teased Ahead of India Debut
  6. Oppo Find X9 Series Launching Today: All You Need to Know
  7. Battlefield 6's Free-to-Play Battle Royale Mode Launches October 28
  1. Oppo Find X9 Pro Launched With 7,500mAh Battery, 200-Megapixel Telephoto Camera Alongside Find X9: Price, Features
  2. Cat Adventure Game Stray is Reportedly Coming to PS Plus Essential in November
  3. WhatsApp Might Soon Let You Set a Profile Cover Photo, Just Like Facebook and LinkedIn
  4. Coinbase Partners Citi to Boost Stablecoin Adoption Amidst Growing Institutional Interest
  5. Adobe Will Now Let You Edit YouTube Shorts on the Premiere App
  6. Ant Group Registers ‘Antcoin’ Trademark in Hong Kong as China Tightens Crypto Rules
  7. iPhone Air Production Reportedly Remains Unchanged Amidst Speculation of Manufacturing Cuts
  8. Samsung Reportedly Working on Pro Camera Presets With Quick Share Support With One UI 8.5 Update
  9. Adobe Introduces AI Assistant in Photoshop, New AI Audio and Video Tools in Firefly
  10. US Lawmaker Proposes Bill to Ban Elected US Officials From Trading Crypto
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.