OpenAI: GPT-4o Audio

by openai

0 stars
Context 128K tokens
Modalities Text, Audio → Text
Max Output 16,384
Input Price $2.50 / million tokens
Output Price $10.00 / million tokens

Overview

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input audio tokens.

Key Features

  • Multimodal capabilities (Text, Audio → Text)
  • 128K tokens context window
  • Audio processing capabilities
  • Up to 16,384 output tokens
  • API access available

Model Information

Developer:

openai

Release Date:

August 15, 2025

Context Window:

128K tokens

Modalities:

Text, Audio → Text

Content Moderation:

Enabled

Pricing

Input Tokens $2.50 / million tokens
Output Tokens $10.00 / million tokens
Get API Key

Discussion

No comments yet. Be the first to share your thoughts about this model!