Google: Gemini 1.5 Flash
by google
Overview
Gemini 1.5 Flash is a foundation model that performs well at a variety of multimodal tasks such as visual understanding, classification, summarization, and creating content from image, audio and video. It's adept at processing visual and text inputs such as photographs, documents, infographics, and screenshots. Gemini 1.5 Flash is designed for high-volume, high-frequency tasks where cost and latency matter. On most common tasks, Flash achieves comparable quality to other Gemini Pro models at a significantly reduced cost. Flash is well-suited for applications like chat assistants and on-demand content generation where speed and scale matter. Usage of Gemini is subject to Google's [Gemini Terms of Use](https://ai.google.dev/terms). #multimodal
Key Features
- Multimodal capabilities (Text, Image → Text)
- 1.0M tokens context window
- Up to 8,192 output tokens
- API access available
Discussion
No comments yet. Be the first to share your thoughts about this model!
Join the discussion. You need to be logged in.