MSFT: OpenAI launches real-time audio...

OpenAI launched three real-time audio models through its new Realtime API. The suite includes GPT-Realtime-2 for voice reasoning, GPT-Realtime-Translate for over 70 languages, and GPT-Realtime-Whisper for streaming transcription.

The release enables developers to build sophisticated applications that reason and communicate in real time. These models handle complex tasks and longer conversations to compete with voice services from major technology companies.

Early adopters like Zillow report significant improvements in call success rates using the technology.

Related News

Anthropic and OpenAI capture 89% of revenue, as Anthropic overtakes OpenAI

Activision settles $250M lawsuit, over rushed Microsoft deal and harassment scandals

Microsoft Loses CMO Yusuf Mehdi, Amid AI Leadership Reshuffle

Microsoft Scraps Traditional Leadership Team, Accelerating AI Development

Microsoft and EY Invest $1B in AI, Scaling Copilot to 400,000 Staff