OpenAI is adding three voice models to its Realtime API, giving developers tools for live reasoning, speech translation, and streaming transcription, the company said. The first model, GPT-Realtime-2, ...
OpenAI released a new generation of voice models in its API on Wednesday, giving developers tools to build apps that can reason through spoken requests, translate across +70 languages, and transcribe ...
OpenAI said Thursday that its API will now include a number of new voice intelligence features designed to help developers create apps that can talk, transcribe, and translate conversations with users ...
GPT-Realtime-2 brings GPT-5-class reasoning to live voice. A separate translation model covers 70+ input languages. A streaming Whisper variant handles transcription. The pricing is aggressive enough ...
What’s new: OpenAI released three voice AI models with real-time reasoning, translation, and transcription capabilities, aiming to make conversations more interactive and task-oriented. Who’s testing: ...
Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Birgitta Böckeler, Distinguished Engineer at ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results