OpenAI launched three real-time audio models through its new Realtime API. The suite includes GPT-Realtime-2 for voice reasoning, GPT-Realtime-Translate for over 70 languages, and GPT-Realtime-Whisper for streaming transcription.
The release enables developers to build sophisticated applications that reason and communicate in real time. These models handle complex tasks and longer conversations to compete with voice services from major technology companies.
Early adopters like Zillow report significant improvements in call success rates using the technology.