Meta: Llama 4 Scout

by meta-llama

0 stars
Context 1.0M tokens
Modalities Text, Image → Text
Max Output 1,048,576
Input Price $0.08 / million tokens
Output Price $0.30 / million tokens

Overview

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input (text and image) and multilingual output (text and code) across 12 supported languages. Designed for assistant-style interaction and visual reasoning, Scout uses 16 experts per forward pass and features a context length of 10 million tokens, with a training corpus of ~40 trillion tokens. Built for high efficiency and local or commercial deployment, Llama 4 Scout incorporates early fusion for seamless modality integration. It is instruction-tuned for use in multilingual chat, captioning, and image understanding tasks. Released under the Llama 4 Community License, it was last trained on data up to August 2024 and launched publicly on April 5, 2025.

Key Features

  • Multimodal capabilities (Text, Image → Text)
  • 1.0M tokens context window
  • Up to 1,048,576 output tokens
  • API access available

Model Information

Developer:

meta-llama

Release Date:

April 5, 2025

Context Window:

1.0M tokens

Modalities:

Text, Image → Text

Pricing

Input Tokens $0.08 / million tokens
Output Tokens $0.30 / million tokens
Get API Key

Discussion

No comments yet. Be the first to share your thoughts about this model!