Qwen: Qwen3 8B

by qwen

0 stars
Context 128K tokens
Modalities Text
Max Output 20,000
Input Price $0.04 / million tokens
Output Price $0.14 / million tokens

Overview

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.

Key Features

  • 128K tokens context window
  • Up to 20,000 output tokens
  • API access available

Model Information

Developer:

qwen

Release Date:

April 28, 2025

Context Window:

128K tokens

Modalities:

Text

Pricing

Input Tokens $0.04 / million tokens
Output Tokens $0.14 / million tokens
Get API Key

Discussion

No comments yet. Be the first to share your thoughts about this model!