OpenAI is adding three voice models to its Realtime API, giving developers tools for live reasoning, speech translation, and streaming transcription, the company said. The first model, GPT-Realtime-2, ...
OpenAI released a new generation of voice models in its API on Wednesday, giving developers tools to build apps that can reason through spoken requests, translate across +70 languages, and transcribe ...
OpenAI said Thursday that its API will now include a number of new voice intelligence features designed to help developers create apps that can talk, transcribe, and translate conversations with users ...
GPT-Realtime-2 brings GPT-5-class reasoning to live voice. A separate translation model covers 70+ input languages. A streaming Whisper variant handles transcription. The pricing is aggressive enough ...
What’s new: OpenAI released three voice AI models with real-time reasoning, translation, and transcription capabilities, aiming to make conversations more interactive and task-oriented. Who’s testing: ...
Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Birgitta Böckeler, Distinguished Engineer at ...