Google: Gemma 3 4B
by google
0 stars
Context
131K tokens
Modalities
Text, Image → Text
Input Price
$0.04 / million tokens
Output Price
$0.08 / million tokens
Overview
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities, including structured outputs and function calling.
Key Features
- Multimodal capabilities (Text, Image → Text)
- 131K tokens context window
- API access available
Discussion
No comments yet. Be the first to share your thoughts about this model!
Join the discussion. You need to be logged in.