Category
Multimodal Models
The strongest multimodal models — those that can take image, audio, or video in alongside text.
2 models
← All categories
Top Multimodal models
| Rank | Model | Pricing |
|---|---|---|
| 29 | Gemini Pro 2 Google DeepMind | $1.50/1M input |
| 6 | GPT-5.5 OpenAI | $12.50/1M input |