OpenAI announces new updates for AI voice agents, new speech-to-speech model

  • OpenAI, the maker of the ChatGPT, announced several new updates on Thursday for its artificial intelligence voice agents.
  • The Sam Altman-led company said it had updated its Realtime API, which will let developers and other companies create more reliable voice agents. “The API now supports remote MCP servers, image inputs, and phone calling through Session Initiation Protocol (SIP), making voice agents more capable through access to additional tools and context,” OpenAI said in a blog post.
  • The Realtime API was unveiled in public beta in October 2024 and “thousands” of developers have already used it, OpenAI said.
  • OpenAI also said it was unveiling a new speech-to-speech model, known as gpt-realtime. “It’s better at interpreting system messages and developer prompts—whether that’s reading disclaimer scripts word-for-word on a support call, repeating back alphanumerics, or switching seamlessly between languages mid-sentence,” OpenAI added. Included in the release are two new voices, Cedar and Marin.
  • OpenAI is financially backed by Microsoft (NASDAQ:MSFT).

Leave a Reply

Your email address will not be published. Required fields are marked *