Qwen: Qwen VL Plus
by qwen
0 stars
Context
8K tokens
Modalities
Text, Image → Text
Max Output
1,500
Input Price
$0.21 / million tokens
Output Price
$0.63 / million tokens
Overview
Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for image input. It delivers significant performance across a broad range of visual tasks.
Key Features
- Multimodal capabilities (Text, Image → Text)
- 8K tokens context window
- Up to 1,500 output tokens
- API access available
Discussion
No comments yet. Be the first to share your thoughts about this model!
Join the discussion. You need to be logged in.