Episode Details
Back to Episodes
OpenAI's New Voice AI Models: Real-time Conversations, Translation, Transcription
Description
OpenAI unveils three new voice AI models for real-time use in developer apps: GPT-Realtime-two for complex conversations, GPT-Realtime-Translate for live speech translation, and GPT-Realtime-Whisper for instant transcription. These models, part of the Realtime API, aim to revolutionize voice-driven tools. Pricing is accessible, with GPT-Realtime-two at $0.32 per million input tokens and $0.64 per million output. GPT-Realtime-Translate is $0.034 per minute, and GPT-Realtime-Whisper is $0.017. Developers can test them now in the Playground or integrate GPT-Realtime-two with Codex.
Support the show:
Get a discount at https://solipillow.com/discount/dnn.
Advertise on DNN:
advertise@thednn.ai
This is an automated, high-level news summary based on public reporting.
Report issues to feedback@thednn.ai.
View sources & latest updates:
https://sources.thednn.ai/06c7527ad2036e5d