An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
Hello Abdul Rehman,
Welcome to the Microsoft Q&A and thank you for posting your questions here.
I understand that gpt-realtime-2 on Azure OpenAI is intermittently stopping audio output mid-response, while the same application, prompt, and PCM16 24 kHz mono audio setup works correctly against OpenAI’s direct Realtime API.
In addition to @SRILAKSHMI C
The most reliable fix is to run the Azure Realtime integration on the GA /openai/v1 Realtime endpoint, pass the Azure deployment name as the model value, and use WebRTC for live user-facing audio. Then reproduce the issue while logging all Realtime events. If Azure stops emitting response.audio.delta and either sends response.done as completed with missing audio or never sends response.audio.done / response.done while the connection remains open, open a Microsoft Azure support case with the trace and correlation IDs because that indicates an Azure service-side streaming defect rather than a normal client implementation issue.
Use the below resource links for more reading and implementation steps:
- Use the GPT Realtime API for speech and audio with Azure OpenAI
- Use the GPT Realtime API via WebSockets
- Use the GPT Realtime API via WebRTC
- Azure OpenAI Realtime API reference
- Microsoft Foundry model catalog — gpt-realtime-2
I hope this is helpful. Please! Do not hesitate to let me know if you have any other questions, steps or clarifications.
Please do not close the thread by upvoting and accepting the answer if any part of it is helpful.