{"id":7565,"date":"2022-12-05T00:00:00","date_gmt":"2022-12-05T00:00:00","guid":{"rendered":"https:\/\/azure.microsoft.com\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech"},"modified":"2025-06-09T23:47:44","modified_gmt":"2025-06-10T06:47:44","slug":"improve-speechtotext-accuracy-with-azure-custom-speech","status":"publish","type":"post","link":"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/","title":{"rendered":"Improve speech-to-text accuracy with Azure Custom Speech"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">With <a href=\"https:\/\/azure.microsoft.com\/services\/cognitive-services\/speech-services\">Microsoft Azure Cognitive Services for Speech<\/a>, customers can build voice-enabled apps confidently and quickly in more than 140 languages. We make it easy for customers to transcribe speech to text (STT) with high accuracy, produce natural-sounding text-to-speech (TTS) voices, and translate spoken audio. In the past few years, we are inspired by the ways customers seek our customization features to fine-tune speech recognition to their use cases.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">As our speech technology continues to change and evolve, we want to introduce four custom speech-to-text capabilities and their respective customer use cases. With these features, you can evaluate and improve the speech-to-text accuracy for your applications and products. A custom speech model is trained on top of a base model. With a custom model, you can improve recognition of domain-specific vocabulary by providing text data to train the model. You can also improve recognition based on the specific audio conditions of the application by providing audio data with reference transcriptions.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"custom-speech-data-types-and-use-cases\">Custom Speech data types and use cases<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Our Custom Speech features will let you customize Microsoft&#8217;s speech-to-text engine. You will be able to customize the language model by tailoring it to the vocabulary of the application and customize the acoustic model to adapt to the speaking style of your users. By uploading text and\/or audio data through Custom Speech, you&#8217;ll be able to create these custom models, combine them with Microsoft&#8217;s state-of-the-art speech models, and deploy them to a custom speech-to-text endpoint that can be accessed from any device.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Phrase<\/strong> <strong>list<\/strong>: A real-time accuracy enhancement feature that does not need model training. For example, in a meeting or podcast scenario, you can add a list of participant names, products, and uncommon jargon using phrase list to boost their recognition.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Plain<\/strong> <strong>text<\/strong>: Our simplest custom speech model can be made using just text data. Customers in the media industry use this in use cases such as commentary of sports events. Because each sporting event\u2019s vocabulary differs significantly from others, building a custom model specific to a sport increases accuracy by biasing to the vocabulary of the event.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Structured<\/strong> <strong>text<\/strong>: This is text data that boosts patterns of sentences in speech. These patterns could be utterances that differ only by individual words or phrases, for example, \u201cMay I speak with <em>name<\/em>\u201d where <em>name<\/em> is a list of possible names of individuals. The pattern can link to this list of entities (<em>name<\/em> in this case), and you can also provide their unique pronunciations.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Audio<\/strong>: You can train a custom speech model using audio data, with or without human-labeled transcripts. With human-labeled transcripts, you can improve recognition accuracy on speaking styles, accents, or specific background noises. For American English, you can now train without needing a labeled transcript to improve acoustic aspects such as slight accents, speaking styles, and background noises.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"research-milestones\">Research milestones<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/www.microsoft.com\/research\/group\/speech-research-team\" target=\"_blank\" rel=\"noopener\">Microsoft\u2019s speech and dialog research group<\/a> achieved a milestone in reaching human parity in 2016 on the Switchboard conversational speech recognition task, meaning we had created technology that recognized words in a conversation as well as professional human transcribers. After further experimentation, we then followed up with a 5.1 percent word error rate, exceeding human parity in 2017. A <a href=\"https:\/\/arxiv.org\/abs\/1708.06073\" target=\"_blank\" rel=\"noopener\">technical report<\/a> published outlines the details of our system. Today, Custom Speech helps enterprises and developers improve upon the milestones achieved by Microsoft Research.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"customer-inspiration\">Customer inspiration<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Peloton<\/strong>: In the past, Peloton provided subtitles only for its on-demand classes. But that meant that the signature live experience so valued by members was not accessible to those who are deaf or hard of hearing. While the decision to introduce live subtitles was clear, executing on that vision proved a bit murkier. A primary challenge was determining how automated speech recognition software could facilitate Peloton\u2019s specific vocabulary, including the numerical phrases used for class countdowns and to set resistance and cadence levels. Latency was another issue\u2014subtitles wouldn\u2019t be very useful, after all, if they lagged behind what instructors were saying. <a href=\"https:\/\/news.microsoft.com\/transform\/using-microsoft-azure-and-its-ai-capabilities-peloton-develops-live-subtitles-for-members-who-are-deaf-or-hard-of-hearing\/\">Peloton chose Azure Cognitive Services<\/a> because it was cost-effective and allowed Peloton to customize its own machine learning model for converting speech to text\u2014and was significantly faster than other solutions on the market. Microsoft also provided a team of engineers that worked alongside Peloton throughout the development process.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"speech-services-and-responsible-ai\">Speech Services and Responsible AI<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">We\u202fare excited about the future\u202fof\u202fAzure Speech with human-like, diverse, and delightful quality under the high-level architecture of the <a href=\"https:\/\/www.microsoft.com\/research\/blog\/a-holistic-representation-toward-integrative-ai\" target=\"_blank\" rel=\"noopener\">XYZ-code<\/a> AI framework. Our technology advancements are also guided by <a href=\"https:\/\/www.microsoft.com\/ai\/responsible-ai\" target=\"_blank\" rel=\"noopener\">Microsoft\u2019s Responsible AI process<\/a>, and our principles of fairness, inclusiveness, reliability and safety, transparency, privacy and security, and accountability. We put these ethical standards into practice through the Office of Responsible AI (ORA)\u2014which sets our rules and governance processes, the AI Ethics and Effects in Engineering and Research (Aether) Committee\u2014which advises our leadership on the challenges and opportunities presented by AI innovations, and Responsible AI Strategy in Engineering (RAISE)\u2014a team that enables the implementation of Microsoft Responsible AI rules across engineering groups.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"get-started-with-azure-cognitive-services-for-speech\">Get started with Azure Cognitive Services for Speech<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">You can use <a href=\"https:\/\/speech.microsoft.com\">Speech Studio<\/a> to test how custom speech features would help improve recognition for your audio. In addition, start building new customer experiences with <a href=\"https:\/\/azure.microsoft.com\/services\/cognitive-services\/text-to-speech\">Azure Neural TTS<\/a> and <a href=\"https:\/\/azure.microsoft.com\/services\/cognitive-services\/speech-to-text\">STT<\/a>. In addition, the <a href=\"https:\/\/speech.microsoft.com\/customvoice\">Custom Neural Voice<\/a> capability enables organizations to create a unique brand voice in multiple languages and styles.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Resources<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/speech.microsoft.com\/portal\">Try out Speech services in the Studio<\/a>.<\/li>\n\n\n\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/learn.microsoft.com\/azure\/cognitive-services\/speech-service\/custom-speech-overview\" target=\"_blank\" rel=\"noopener\">Get started with Custom Speech<\/a>.<\/li>\n\n\n\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/docs.microsoft.com\/azure\/cognitive-services\/speech-service\/get-started-speech-to-text\" target=\"_blank\" rel=\"noopener\">Get started with speech to text<\/a>.<\/li>\n\n\n\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/docs.microsoft.com\/azure\/cognitive-services\/speech-service\/get-started-speech-translation\" target=\"_blank\" rel=\"noopener\">Get started with text to speech<\/a>.<\/li>\n\n\n\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/docs.microsoft.com\/azure\/cognitive-services\/speech-service\/how-to-custom-voice\" target=\"_blank\" rel=\"noopener\">Get started with Custom Neural Voice<\/a>.<\/li>\n\n\n\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/docs.microsoft.com\/azure\/cognitive-services\/speech-service\/get-started-speech-translation\" target=\"_blank\" rel=\"noopener\">Get started with speech translation<\/a>.<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>With Microsoft Azure Cognitive Services for Speech, customers can build voice-enabled apps confidently and quickly in more than 140 languages. We make it easy for customers to transcribe speech to text (STT) with high accuracy, produce natural-sounding text-to-speech (TTS) voices, and translate spoken audio.<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"ms_queue_id":[],"ep_exclude_from_search":false,"_classifai_error":"","_classifai_text_to_speech_error":"","_alt_title":"","footnotes":"","msx_community_cta_settings":[]},"categories":[1454],"tags":[],"audience":[3057,3055,3056],"content-type":[1481],"product":[3164],"tech-community":[],"topic":[],"coauthors":[1708],"class_list":["post-7565","post","type-post","status-publish","format-standard","hentry","category-ai-machine-learning","audience-data-professionals","audience-developers","audience-it-implementors","content-type-thought-leadership","product-microsoft-foundry","review-flag-1680286581-295","review-flag-1-1680286581-825","review-flag-5-1680286581-950","review-flag-and-o-1680286581-349","review-flag-machi-1680286585-314","review-flag-microsofts","review-flag-new-1680286579-546","review-flag-percent"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Improve speech-to-text accuracy with Azure Custom Speech | Microsoft Azure Blog<\/title>\n<meta name=\"description\" content=\"With Microsoft Azure Cognitive Services for Speech, customers can build voice-enabled apps confidently and quickly in more than 140 languages. We make it easy for customers to transcribe speech to text (STT) with high accuracy, produce natural-sounding text-to-speech (TTS) voices, and translate spoken audio.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Improve speech-to-text accuracy with Azure Custom Speech | Microsoft Azure Blog\" \/>\n<meta property=\"og:description\" content=\"With Microsoft Azure Cognitive Services for Speech, customers can build voice-enabled apps confidently and quickly in more than 140 languages. We make it easy for customers to transcribe speech to text (STT) with high accuracy, produce natural-sounding text-to-speech (TTS) voices, and translate spoken audio.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/\" \/>\n<meta property=\"og:site_name\" content=\"Microsoft Azure Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/microsoftazure\" \/>\n<meta property=\"article:published_time\" content=\"2022-12-05T00:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-06-10T06:47:44+00:00\" \/>\n<meta name=\"author\" content=\"Andy Beatman\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@azure\" \/>\n<meta name=\"twitter:site\" content=\"@azure\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Andy Beatman\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/\"},\"author\":[{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/andy-beatman\/\",\"@type\":\"Person\",\"@name\":\"Andy Beatman\"}],\"headline\":\"Improve speech-to-text accuracy with Azure Custom Speech\",\"datePublished\":\"2022-12-05T00:00:00+00:00\",\"dateModified\":\"2025-06-10T06:47:44+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/\"},\"wordCount\":932,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"articleSection\":[\"AI + machine learning\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/\",\"name\":\"Improve speech-to-text accuracy with Azure Custom Speech | Microsoft Azure Blog\",\"isPartOf\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\"},\"datePublished\":\"2022-12-05T00:00:00+00:00\",\"dateModified\":\"2025-06-10T06:47:44+00:00\",\"description\":\"With Microsoft Azure Cognitive Services for Speech, customers can build voice-enabled apps confidently and quickly in more than 140 languages. We make it easy for customers to transcribe speech to text (STT) with high accuracy, produce natural-sounding text-to-speech (TTS) voices, and translate spoken audio.\",\"breadcrumb\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Blog home\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"AI + machine learning\",\"item\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/ai-machine-learning\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Improve speech-to-text accuracy with Azure Custom Speech\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#website\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"name\":\"Microsoft Azure Blog\",\"description\":\"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.\",\"publisher\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization\",\"name\":\"Microsoft Azure Blog\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"contentUrl\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp\",\"width\":512,\"height\":512,\"caption\":\"Microsoft Azure Blog\"},\"image\":{\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/microsoftazure\",\"https:\/\/x.com\/azure\",\"https:\/\/www.instagram.com\/microsoftdeveloper\/\",\"https:\/\/www.linkedin.com\/company\/16188386\",\"https:\/\/www.youtube.com\/user\/windowsazure\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/c702e5edd662b328b49b7e1180cab117\",\"name\":\"shakir\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g7664e653ea371ce16eaf75e9fa8952c4\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g\",\"caption\":\"shakir\"},\"sameAs\":[\"https:\/\/azure.microsoft.com\"],\"url\":\"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/shakir\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Improve speech-to-text accuracy with Azure Custom Speech | Microsoft Azure Blog","description":"With Microsoft Azure Cognitive Services for Speech, customers can build voice-enabled apps confidently and quickly in more than 140 languages. We make it easy for customers to transcribe speech to text (STT) with high accuracy, produce natural-sounding text-to-speech (TTS) voices, and translate spoken audio.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/","og_locale":"en_US","og_type":"article","og_title":"Improve speech-to-text accuracy with Azure Custom Speech | Microsoft Azure Blog","og_description":"With Microsoft Azure Cognitive Services for Speech, customers can build voice-enabled apps confidently and quickly in more than 140 languages. We make it easy for customers to transcribe speech to text (STT) with high accuracy, produce natural-sounding text-to-speech (TTS) voices, and translate spoken audio.","og_url":"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/","og_site_name":"Microsoft Azure Blog","article_publisher":"https:\/\/www.facebook.com\/microsoftazure","article_published_time":"2022-12-05T00:00:00+00:00","article_modified_time":"2025-06-10T06:47:44+00:00","author":"Andy Beatman","twitter_card":"summary_large_image","twitter_creator":"@azure","twitter_site":"@azure","twitter_misc":{"Written by":"Andy Beatman","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/#article","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/"},"author":[{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/andy-beatman\/","@type":"Person","@name":"Andy Beatman"}],"headline":"Improve speech-to-text accuracy with Azure Custom Speech","datePublished":"2022-12-05T00:00:00+00:00","dateModified":"2025-06-10T06:47:44+00:00","mainEntityOfPage":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/"},"wordCount":932,"commentCount":0,"publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"articleSection":["AI + machine learning"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/","name":"Improve speech-to-text accuracy with Azure Custom Speech | Microsoft Azure Blog","isPartOf":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website"},"datePublished":"2022-12-05T00:00:00+00:00","dateModified":"2025-06-10T06:47:44+00:00","description":"With Microsoft Azure Cognitive Services for Speech, customers can build voice-enabled apps confidently and quickly in more than 140 languages. We make it easy for customers to transcribe speech to text (STT) with high accuracy, produce natural-sounding text-to-speech (TTS) voices, and translate spoken audio.","breadcrumb":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/improve-speechtotext-accuracy-with-azure-custom-speech\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blog home","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/"},{"@type":"ListItem","position":2,"name":"AI + machine learning","item":"https:\/\/azure.microsoft.com\/en-us\/blog\/category\/ai-machine-learning\/"},{"@type":"ListItem","position":3,"name":"Improve speech-to-text accuracy with Azure Custom Speech"}]},{"@type":"WebSite","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#website","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","name":"Microsoft Azure Blog","description":"Get the latest Azure news, updates, and announcements from the Azure blog. From product updates to hot topics, hear from the Azure experts.","publisher":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/azure.microsoft.com\/en-us\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#organization","name":"Microsoft Azure Blog","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","contentUrl":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/06\/microsoft_logo.webp","width":512,"height":512,"caption":"Microsoft Azure Blog"},"image":{"@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/microsoftazure","https:\/\/x.com\/azure","https:\/\/www.instagram.com\/microsoftdeveloper\/","https:\/\/www.linkedin.com\/company\/16188386","https:\/\/www.youtube.com\/user\/windowsazure"]},{"@type":"Person","@id":"https:\/\/azure.microsoft.com\/en-us\/blog\/#\/schema\/person\/c702e5edd662b328b49b7e1180cab117","name":"shakir","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g7664e653ea371ce16eaf75e9fa8952c4","url":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/9342c7c05bb16548741bc5cd3a3e3b7ee0c8e746844ad2cc582db5beb5514c6f?s=96&d=mm&r=g","caption":"shakir"},"sameAs":["https:\/\/azure.microsoft.com"],"url":"https:\/\/azure.microsoft.com\/en-us\/blog\/author\/shakir\/"}]}},"msxcm_display_generated_audio":false,"msxcm_animated_featured_image":null,"distributor_meta":false,"distributor_terms":false,"distributor_media":false,"distributor_original_site_name":"Microsoft Azure Blog","distributor_original_site_url":"https:\/\/azure.microsoft.com\/en-us\/blog","push-errors":false,"_links":{"self":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/7565","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/comments?post=7565"}],"version-history":[{"count":1,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/7565\/revisions"}],"predecessor-version":[{"id":41118,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/posts\/7565\/revisions\/41118"}],"wp:attachment":[{"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/media?parent=7565"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/categories?post=7565"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tags?post=7565"},{"taxonomy":"audience","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/audience?post=7565"},{"taxonomy":"content-type","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/content-type?post=7565"},{"taxonomy":"product","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/product?post=7565"},{"taxonomy":"tech-community","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/tech-community?post=7565"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/topic?post=7565"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-json\/wp\/v2\/coauthors?post=7565"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}