{"id":36688,"date":"2024-10-01T13:00:00","date_gmt":"2024-10-01T20:00:00","guid":{"rendered":"https:\/\/azure.microsoft.com\/en-us\/blog\/?p=36688"},"modified":"2025-12-26T09:04:37","modified_gmt":"2025-12-26T17:04:37","slug":"announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities","status":"publish","type":"post","link":"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/","title":{"rendered":"Announcing new products and features for Azure OpenAI Service including GPT-4o-Realtime-Preview with audio and speech capabilities"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">We are thrilled to announce the public preview of GPT-4o-Realtime-Preview for audio and speech, a major enhancement to <a href=\"https:\/\/azure.microsoft.com\/en-us\/products\/ai-services\/openai-service?msockid=36d156f74ff96d723016422b4e966cdb\">Microsoft Azure OpenAI Service<\/a> that adds advanced voice capabilities and expands GPT-4o&#8217;s multimodal offerings. This milestone further solidifies Azure&#8217;s leadership in AI, especially in the realm of speech technology. Azure\u2019s legacy in this space has been long-established through its speech service, which historically integrated speech-to-text, text-to-speech, neural voices, and real-time translation across core Microsoft products like Teams, Office 365, and Edge.<\/p>\n\n\n\n<aside class=\"cta-block cta-block--align-left cta-block--has-image wp-block-msx-cta\" data-bi-an=\"CTA Block\">\n\t<div class=\"cta-block__content\">\n\t\t\t\t\t<div class=\"cta-block__image-container\">\n\t\t\t\t<img loading=\"lazy\" decoding=\"async\" width=\"850\" height=\"478\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/09\/img-3.png\" class=\"cta-block__image\" alt=\"\" srcset=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/09\/img-3.webp 850w, https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/09\/img-3-300x169.webp 300w, https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/09\/img-3-768x432.webp 768w\" sizes=\"auto, (max-width: 850px) 100vw, 850px\" \/>\t\t\t<\/div>\n\t\t\n\t\t<div class=\"cta-block__body\">\n\t\t\t<h2 class=\"cta-block__headline\">Azure OpenAI Service<\/h2>\n\t\t\t<p class=\"cta-block__text\">Build your own copilot and generative AI applications.<\/p>\n\t\t\t\t\t\t\t<div class=\"cta-block__actions\">\n\t\t\t\t\t<a\n\t\t\t\t\t\thref=\"https:\/\/azure.microsoft.com\/en-us\/products\/ai-services\/openai-service\"\n\t\t\t\t\t\tclass=\"btn cta-block__link btn-link\"\n\t\t\t\t\t\t\t\t\t\t\t>\n\t\t\t\t\t\tFind your AI solution\t\t\t\t\t<\/a>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t<\/div>\n<\/aside>\n\n\n\n<p class=\"wp-block-paragraph\">Now, GPT-4o-Realtime-Preview pushes the boundaries even further by integrating language generation with seamless voice interaction, giving developers the tools they need to craft more natural and conversational AI experiences. From creating virtual assistants to powering real-time customer support, this new model opens a vast array of possibilities for voice-driven applications.&nbsp;The new model is also integrated with Copilot, as part of the <a href=\"https:\/\/blogs.microsoft.com\/blog\/2024\/10\/01\/an-ai-companion-for-everyone\/\" target=\"_blank\" rel=\"noreferrer noopener\">new Copilot Voice product<\/a> announced.<\/p>\n\n\n\n<div class=\"wp-block-buttons is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-a89b3969 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link has-text-align-center wp-element-button\" href=\"https:\/\/youtu.be\/n4R1LWvqa1k\" target=\"_blank\" rel=\"noreferrer noopener\">Watch a demo of the speech and audio capabilities on GPT-4o Realtime<\/a><\/div>\n<\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"building-on-recent-azure-openai-announcements\">Building on recent Azure OpenAI announcements&nbsp;<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">This announcement continues a <a href=\"https:\/\/learn.microsoft.com\/en-us\/azure\/ai-services\/openai\/whats-new\" target=\"_blank\" rel=\"noreferrer noopener\">series of significant updates<\/a> within Azure OpenAI Service, including:&nbsp;<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/introducing-o1-openais-new-reasoning-model-series-for-developers-and-enterprises-on-azure\/\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>O1 Series<\/strong><\/a>: A new lineup of models designed for advanced reasoning over complex data. We are happy to make the API available to our developers on Azure today after a two-week preview in the Azure AI Studio Playground.&nbsp;<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/enterprise-trust-in-azure-openai-service-strengthened-with-data-zones\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>Data zones<\/strong><\/a>: Enabling regional data residency to support customer privacy and compliance.&nbsp;<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/techcommunity.microsoft.com\/t5\/ai-azure-ai-services-blog\/announcing-global-provisioned-managed-deployments-for-scaling\/ba-p\/4249224\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>Expanded provisioned deployments<\/strong><\/a>: Extending availability to a global SKU for customers needing dedicated capacity.&nbsp;<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-fine-tuning-for-customization-and-support-for-new-models-in-azure-ai\/\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>General availability of fine-tuning<\/strong><\/a>: Allowing GPT-4o and mini models to be tailored for specialized use cases.&nbsp;<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/blogs.microsoft.com\/blog\/2024\/09\/24\/microsoft-trustworthy-ai-unlocking-human-potential-starts-with-trust\/\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>Trustworthy AI<\/strong>:<\/a> New tooling, including evaluations in Azure AI Studio to support proactive risk assessments, and watermarking on images generated by DALL*E.&nbsp;<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Cache Prompting<\/strong> (coming soon): Cheaper and faster inferencing through caching on GPT-4o and o1 models.&nbsp;<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">This continuous evolution demonstrates Azure\u2019s commitment to providing the most comprehensive, secure, and versatile AI tools to customers worldwide. <a href=\"https:\/\/aka.ms\/aoai-newsfeed\" target=\"_blank\" rel=\"noreferrer noopener\">Bookmark our newsfeed<\/a> to track all future announcements.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"what-s-new-in-gpt-4o-realtime-preview\">What\u2019s new in GPT-4o-Realtime-Preview?&nbsp;<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>GPT-4o-Realtime API<\/strong>: With this release, GPT-4o evolves to support audio input and output, enabling real-time, natural voice-based interactions that go beyond traditional text-based AI conversations. This multimodal capability empowers developers to build innovative voice applications with ease.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Azure AI Studio Early Access playground<\/strong>: For developers eager to explore, this dedicated space allows early experimentation with GPT-4o-Realtime API for Audio capabilities. The studio provides an environment to test, fine-tune, and optimize voice interactions before launching them into production environments.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"performance-that-speaks-for-itself\">Performance that speaks for itself&nbsp;<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Early customers using GPT-4o-Realtime API for Audio shared remarkable results, confirming its performance and impact:&nbsp;<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Faster responses<\/strong>: GPT-4o-Realtime API for Audio provides voice responses significantly faster than many traditional text-to-speech engines, leading to reduced latency and smoother interactions.&nbsp;<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Natural conversations<\/strong>: The model minimizes the robotic tone often associated with AI-generated speech, making conversations sound more engaging.&nbsp;<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Multilingual support<\/strong>: The API supports a wide range of languages, allowing for natural, multilingual conversations that can be applied to global-facing applications.&nbsp;<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"applications-of-gpt-4o-realtime-preview-in-azure-openai-service\">Applications of GPT-4o-Realtime-Preview in Azure OpenAI Service&nbsp;<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">The potential of GPT-4o-Realtime-Preview spans across various industries, transforming how businesses operate and how users interact with technology:&nbsp;<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Customer service<\/strong>: Voice-based chatbots and virtual assistants can now handle customer inquiries more naturally and efficiently, reducing wait times and improving overall satisfaction.&nbsp;<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Content creation<\/strong>: Media producers can revolutionize their workflows by leveraging speech generation for use in video games, podcasts, and film studios.&nbsp;<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Real-time translation<\/strong>: Industries such as healthcare and legal services can benefit from real-time audio translation, breaking down language barriers and fostering better communication in critical contexts.&nbsp;<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"use-cases-driving-innovation\">Use cases driving innovation&nbsp;<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">The versatility of GPT-4o-Realtime-Preview is already transforming operations across a variety of sectors. Here are a few early adopters and how they\u2019re benefiting from this technology:&nbsp;<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong><a href=\"https:\/\/www.bosch.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">Bosch<\/a><\/strong> <strong>(Germany)<\/strong>: Integrating GPT-4o-Realtime API for Audio for virtual reality training in automotive settings, allowing consumers and technicians to receive voice-guided instructions.<\/li>\n<\/ul>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"has-large-font-size wp-block-paragraph\"><em>\u201cAOAI is an ideal interface for our\u202fHeyBosch &#8211; Virtual Sales Executive Solution\u202fas it is a conversation first solution. We can easily integrate AOAI to our existing solution \u2013 Thanks for the reference samples. The response time from the virtual agent has improved substantially as we now have a single interface coupling both (speech and LLM). This helps in keeping latency minimal.\u202f This integration shows the art of possibility of creating compelling user experiences combining GenAI, 3D tech and real time speech processing capabilities.\u201d<\/em>\u2014<em>Vamsidhar Sunkari Senior Expert Bosch Global Software Technologies Pvt Ltd.<\/em>&nbsp;<\/p>\n<\/blockquote>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong><a href=\"https:\/\/www.lyrebirdhealth.com\/us\" target=\"_blank\" rel=\"noreferrer noopener\">Lyrebird Health<\/a> (Australia)<\/strong>: Using GPT-4o-Realtime-Preview as a medical copilot, summarizing patient information and automating follow-up tasks in real-time.<\/li>\n<\/ul>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"has-large-font-size wp-block-paragraph\">&#8220;<em>Lyrebird Health is excited to bring audio capabilities to the provider\/patient relationship. The new GPT-4o-realtime-preview model will allow us to experiment and launch new experiences for our customers and end users. This will help us on our mission to provide the best people technology on the planet.&#8221;\u2014Kai Van Lieshout, Co-founder and CEO of Lyrebird Health<\/em><\/p>\n<\/blockquote>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Azure AI Search<\/strong>: VoiceRAG leverages Azure OpenAI&#8217;s GPT-4o real-time audio model and Azure AI Search to create an advanced voice-based generative AI application with Retrieval-Augmented Generation (RAG). The system integrates real-time audio streaming and function calling to perform knowledge base searches, ensuring responses are well-grounded without compromising latency. By securely handling model configurations and retrieval processes on the backend, VoiceRAG provides a natural, conversational interface that includes citations seamlessly displayed in the user experience. <span data-contrast=\"auto\" xml:lang=\"EN-US\" lang=\"EN-US\" class=\"TextRun EmptyTextRun SCXW104393174 BCX8\" style=\"margin: 0px;padding: 0px;font-size: 12pt;line-height: 18px;font-family: Aptos, Aptos_EmbeddedFont, Aptos_MSFontService, sans-serif\"><\/span><span data-contrast=\"none\" xml:lang=\"EN-US\" lang=\"EN-US\" class=\"TextRun Underlined EmptyTextRun SCXW104393174 BCX8\" style=\"margin: 0px;padding: 0px;font-size: 12pt;line-height: 18px;, serif\"><\/span>Deep dive the VoiceRAG experience in a <a href=\"https:\/\/aka.ms\/voicerag\" target=\"_blank\" rel=\"noreferrer noopener\">dedicated blog on Microsoft Tech Community<\/a>.<\/li>\n<\/ul>\n\n\n\n<div class=\"wp-block-buttons is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-a89b3969 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link has-text-align-center wp-element-button\" href=\"https:\/\/youtu.be\/vXJka8xZ9Ko\" target=\"_blank\" rel=\"noreferrer noopener\">See RAG leverage GPT-4o Realtime audio in this demo<\/a><\/div>\n<\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"our-commitment-to-trustworthy-ai\">Our commitment to Trustworthy AI&nbsp;<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/blogs.microsoft.com\/blog\/2024\/09\/24\/microsoft-trustworthy-ai-unlocking-human-potential-starts-with-trust\/\" target=\"_blank\" rel=\"noreferrer noopener\">Azure remains steadfast in its commitment to responsible AI<\/a>, with safety and privacy as default priorities. The Realtime API utilizes multiple layers of safety measures, including automated monitoring and human review, to prevent misuse.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The Realtime API has undergone rigorous evaluations guided by our commitments to Responsible AI. Check out the <a href=\"https:\/\/www.microsoft.com\/en-us\/corporate-responsibility\/responsible-ai-transparency-report?msockid=3b8626837d9d6e763c69323e7c286f90\" target=\"_blank\" rel=\"noreferrer noopener\">2024 Responsible AI Transparency Report<\/a>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Azure OpenAI Service provides built-in Content Safety features at no extra cost, and Azure AI Studio offers tools to assess the safety of your AI applications, ensuring a secure and responsible AI experience.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"what-s-next-with-gpt-4o-realtime-api-for-audio\">What\u2019s next with GPT-4o-Realtime API for Audio?<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">As we continue to innovate and expand the capabilities of GPT-4o-Realtime API for Audio, we are excited to see how developers and businesses will leverage this cutting-edge technology to create voice-driven applications that push the boundaries of what\u2019s possible.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Whether you\u2019re looking to integrate voice capabilities into your customer service operations or explore the possibilities of multilingual interactions, GPT-4o-Realtime API for Audio provides the flexibility and power to transform your AI solutions. Starting today, you can explore these new capabilities in the <a href=\"https:\/\/ai.azure.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">Azure OpenAI Studio<\/a>, experiment with them in the Early Access Playground, or directly integrate the realtime API in public preview into your applications.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Be sure to review our documentation for the latest updates, dive into the available use cases, and start building with GPT-4o-Realtime API for Audio to bring your business to the next level of AI innovation.&nbsp;<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Stay tuned for upcoming customer stories, detailed use case demos, and more as we continue to roll out updates in the weeks ahead!&nbsp;<\/p>\n\n\n\n<div class=\"wp-block-buttons is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-a89b3969 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/azure.microsoft.com\/en-us\/products\/ai-services\/openai-service\">Build custom generative AI apps with Azure OpenAI Service<\/a><\/div>\n<\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n","protected":false},"excerpt":{"rendered":"<p>We are thrilled to announce the public preview of GPT-4o-Realtime-Preview for audio and speech, a major enhancement to Microsoft Azure OpenAI Service that adds advanced voice capabilities and expands GPT-4o&#8217;s multimodal offerings.<\/p>\n","protected":false},"author":39,"featured_media":36697,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"ms_queue_id":[],"ep_exclude_from_search":false,"_classifai_error":"","_classifai_text_to_speech_error":"","_alt_title":"","footnotes":"","msx_community_cta_settings":[]},"categories":[1454],"tags":[2671],"audience":[3072],"content-type":[1465],"product":[1803,2758,1795],"tech-community":[],"topic":[],"coauthors":[3091],"class_list":["post-36688","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-machine-learning","tag-ai","audience-ai-professionals","content-type-announcements","product-azure-ai","product-azure-ai-studio","product-azure-openai","review-flag-1680286581-295","review-flag-gener-1680286584-335","review-flag-integ-1680286579-214","review-flag-lever-1680286579-649","review-flag-new-1680286579-546","review-flag-publi-1680286584-566"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Announcing new products and features for Azure OpenAI Service including GPT-4o-Realtime-Preview with audio and speech capabilities | Microsoft Azure Blog<\/title>\n<meta name=\"description\" content=\"Announcing GPT-4o Audio API, an enhancement to the Azure OpenAI Service that adds advanced voice capabilities and expands GPT-4o&#039;s multimodal offerings.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Announcing new products and features for Azure OpenAI Service including GPT-4o-Realtime-Preview with audio and speech capabilities | Microsoft Azure Blog\" \/>\n<meta property=\"og:description\" content=\"Announcing GPT-4o Audio API, an enhancement to the Azure OpenAI Service that adds advanced voice capabilities and expands GPT-4o&#039;s multimodal offerings.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/\" \/>\n<meta property=\"og:site_name\" content=\"Microsoft Azure Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/microsoftazure\" \/>\n<meta property=\"article:published_time\" content=\"2024-10-01T20:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-12-26T17:04:37+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/10\/Azure_Hero_Wave_Magenta_MagentaGrad.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1260\" \/>\n\t<meta property=\"og:image:height\" content=\"708\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Steve Sweetman\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@azure\" \/>\n<meta name=\"twitter:site\" content=\"@azure\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Steve Sweetman\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/\"},\"author\":[{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/steve-sweetman\/\",\"@type\":\"Person\",\"@name\":\"Steve Sweetman\"}],\"headline\":\"Announcing new products and features for Azure OpenAI Service including GPT-4o-Realtime-Preview with audio and speech capabilities\",\"datePublished\":\"2024-10-01T20:00:00+00:00\",\"dateModified\":\"2025-12-26T17:04:37+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/\"},\"wordCount\":1250,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/10\/Azure_Hero_Wave_Magenta_MagentaGrad.webp\",\"keywords\":[\"AI\"],\"articleSection\":[\"AI + machine learning\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/\",\"name\":\"Announcing new products and features for Azure OpenAI Service including GPT-4o-Realtime-Preview with audio and speech capabilities | Microsoft Azure Blog\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/10\/Azure_Hero_Wave_Magenta_MagentaGrad.webp\",\"datePublished\":\"2024-10-01T20:00:00+00:00\",\"dateModified\":\"2025-12-26T17:04:37+00:00\",\"description\":\"Announcing GPT-4o Audio API, an enhancement to the Azure OpenAI Service that adds advanced voice capabilities and expands GPT-4o's multimodal offerings.\",\"breadcrumb\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#primaryimage\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/10\/Azure_Hero_Wave_Magenta_MagentaGrad.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/10\/Azure_Hero_Wave_Magenta_MagentaGrad.webp\",\"width\":1260,\"height\":708,\"caption\":\"background pattern\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Blog home\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"AI + machine learning\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/ai-machine-learning\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Announcing new products and features for Azure OpenAI Service including GPT-4o-Realtime-Preview with audio and speech capabilities\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"name\":\"Microsoft Azure Blog\",\"description\":\"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.\",\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\",\"name\":\"Microsoft Azure Blog\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"width\":512,\"height\":512,\"caption\":\"Microsoft Azure Blog\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/microsoftazure\",\"https:\/\/x.com\/azure\",\"https:\/\/www.instagram.com\/microsoftdeveloper\/\",\"https:\/\/www.linkedin.com\/company\/16188386\",\"https:\/\/www.youtube.com\/user\/windowsazure\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/dddfb06db704f28e44dc633b15e0d6ae\",\"name\":\"Brianna McGovern\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/secure.gravatar.com\/avatar\/471211b4d059ccb73aa3fda768b31973fb946424996c0376f7f0be3cb919d469?s=96&d=mm&r=g5fc6a76f72449f78acaf535ec3e0c54f\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/471211b4d059ccb73aa3fda768b31973fb946424996c0376f7f0be3cb919d469?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/471211b4d059ccb73aa3fda768b31973fb946424996c0376f7f0be3cb919d469?s=96&d=mm&r=g\",\"caption\":\"Brianna McGovern\"},\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/briannamcgovern\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Announcing new products and features for Azure OpenAI Service including GPT-4o-Realtime-Preview with audio and speech capabilities | Microsoft Azure Blog","description":"Announcing GPT-4o Audio API, an enhancement to the Azure OpenAI Service that adds advanced voice capabilities and expands GPT-4o's multimodal offerings.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/","og_locale":"en_US","og_type":"article","og_title":"Announcing new products and features for Azure OpenAI Service including GPT-4o-Realtime-Preview with audio and speech capabilities | Microsoft Azure Blog","og_description":"Announcing GPT-4o Audio API, an enhancement to the Azure OpenAI Service that adds advanced voice capabilities and expands GPT-4o's multimodal offerings.","og_url":"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/","og_site_name":"Microsoft Azure Blog","article_publisher":"https:\/\/www.facebook.com\/microsoftazure","article_published_time":"2024-10-01T20:00:00+00:00","article_modified_time":"2025-12-26T17:04:37+00:00","og_image":[{"width":1260,"height":708,"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/10\/Azure_Hero_Wave_Magenta_MagentaGrad.png","type":"image\/png"}],"author":"Steve Sweetman","twitter_card":"summary_large_image","twitter_creator":"@azure","twitter_site":"@azure","twitter_misc":{"Written by":"Steve Sweetman","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#article","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/"},"author":[{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/steve-sweetman\/","@type":"Person","@name":"Steve Sweetman"}],"headline":"Announcing new products and features for Azure OpenAI Service including GPT-4o-Realtime-Preview with audio and speech capabilities","datePublished":"2024-10-01T20:00:00+00:00","dateModified":"2025-12-26T17:04:37+00:00","mainEntityOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/"},"wordCount":1250,"commentCount":0,"publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/10\/Azure_Hero_Wave_Magenta_MagentaGrad.webp","keywords":["AI"],"articleSection":["AI + machine learning"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/","name":"Announcing new products and features for Azure OpenAI Service including GPT-4o-Realtime-Preview with audio and speech capabilities | Microsoft Azure Blog","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#primaryimage"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#primaryimage"},"thumbnailUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/10\/Azure_Hero_Wave_Magenta_MagentaGrad.webp","datePublished":"2024-10-01T20:00:00+00:00","dateModified":"2025-12-26T17:04:37+00:00","description":"Announcing GPT-4o Audio API, an enhancement to the Azure OpenAI Service that adds advanced voice capabilities and expands GPT-4o's multimodal offerings.","breadcrumb":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#primaryimage","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/10\/Azure_Hero_Wave_Magenta_MagentaGrad.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/10\/Azure_Hero_Wave_Magenta_MagentaGrad.webp","width":1260,"height":708,"caption":"background pattern"},{"@type":"BreadcrumbList","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/announcing-new-products-and-features-for-azure-openai-service-including-gpt-4o-realtime-preview-with-audio-and-speech-capabilities\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blog home","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/"},{"@type":"ListItem","position":2,"name":"AI + machine learning","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/ai-machine-learning\/"},{"@type":"ListItem","position":3,"name":"Announcing new products and features for Azure OpenAI Service including GPT-4o-Realtime-Preview with audio and speech capabilities"}]},{"@type":"WebSite","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","name":"Microsoft Azure Blog","description":"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.","publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization","name":"Microsoft Azure Blog","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","width":512,"height":512,"caption":"Microsoft Azure Blog"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/microsoftazure","https:\/\/x.com\/azure","https:\/\/www.instagram.com\/microsoftdeveloper\/","https:\/\/www.linkedin.com\/company\/16188386","https:\/\/www.youtube.com\/user\/windowsazure"]},{"@type":"Person","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/dddfb06db704f28e44dc633b15e0d6ae","name":"Brianna McGovern","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/471211b4d059ccb73aa3fda768b31973fb946424996c0376f7f0be3cb919d469?s=96&d=mm&r=g5fc6a76f72449f78acaf535ec3e0c54f","url":"https:\/\/secure.gravatar.com\/avatar\/471211b4d059ccb73aa3fda768b31973fb946424996c0376f7f0be3cb919d469?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/471211b4d059ccb73aa3fda768b31973fb946424996c0376f7f0be3cb919d469?s=96&d=mm&r=g","caption":"Brianna McGovern"},"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/briannamcgovern\/"}]}},"msxcm_display_generated_audio":false,"msxcm_animated_featured_image":null,"distributor_meta":false,"distributor_terms":false,"distributor_media":false,"distributor_original_site_name":"Microsoft Azure Blog","distributor_original_site_url":"https:\/\/azure.microsoft.com\/en-us\/blog","push-errors":false,"_links":{"self":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/36688","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/users\/39"}],"replies":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/comments?post=36688"}],"version-history":[{"count":3,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/36688\/revisions"}],"predecessor-version":[{"id":49110,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/36688\/revisions\/49110"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/media\/36697"}],"wp:attachment":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/media?parent=36688"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/categories?post=36688"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tags?post=36688"},{"taxonomy":"audience","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/audience?post=36688"},{"taxonomy":"content-type","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/content-type?post=36688"},{"taxonomy":"product","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/product?post=36688"},{"taxonomy":"tech-community","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tech-community?post=36688"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/topic?post=36688"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/coauthors?post=36688"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}