Google: Gemma 3n 4B

by google

0 stars
Context 33K tokens
Modalities Text
Input Price $0.02 / million tokens
Output Price $0.04 / million tokens

Overview

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks such as text generation, speech recognition, translation, and image analysis. Leveraging innovations like Per-Layer Embedding (PLE) caching and the MatFormer architecture, Gemma 3n dynamically manages memory usage and computational load by selectively activating model parameters, significantly reducing runtime resource requirements. This model supports a wide linguistic range (trained in over 140 languages) and features a flexible 32K token context window. Gemma 3n can selectively load parameters, optimizing memory and computational efficiency based on the task or device capabilities, making it well-suited for privacy-focused, offline-capable applications and on-device AI solutions. [Read more in the blog post](https://developers.googleblog.com/en/introducing-gemma-3n/)

Key Features

  • 33K tokens context window
  • API access available

Model Information

Developer:

google

Release Date:

May 20, 2025

Context Window:

33K tokens

Modalities:

Text

Pricing

Input Tokens $0.02 / million tokens
Output Tokens $0.04 / million tokens
Get API Key

Discussion

No comments yet. Be the first to share your thoughts about this model!