OpenAI: GPT-4o Audio
by openai
0 stars
Context
128K tokens
Modalities
Text, Audio → Text
Max Output
16,384
Input Price
$2.50 / million tokens
Output Price
$10.00 / million tokens
Overview
The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input audio tokens.
Key Features
- Multimodal capabilities (Text, Audio → Text)
- 128K tokens context window
- Audio processing capabilities
- Up to 16,384 output tokens
- API access available
Discussion
No comments yet. Be the first to share your thoughts about this model!
Join the discussion. You need to be logged in.