Try Cognitive Services for free
Sign-in to create your API
You're almost ready to start building with your 7-day free evaluation.
Login using your preferred account
Custom speech service: Speech Transcription with Custom Model
Overcome speech recognition barriers such as speaking style, vocabulary, and background noise. Our speech recognition technologies combine multiple APIs to produce the text output. Customers can customise the APIs to their needs and available data.
See it in action
Create custom language models tailored to users’ speaking styles
Do not let varied vocabularies and speaking styles block understanding. Customise the language model of your app’s speech recognition by tailoring it to your industry expressions, technical, geography or market terms and even speaker style.
Adapt to user environment with custom acoustic models
Make sure your app’s speech recognition can function in all environments. With custom acoustic models, you can account for background noise and match your users’ expected environments.
Use robust speech models from Microsoft
Enable powerful, personalised speech recognition by building your own customised speech recognition models on top of Microsoft’s existing state-of-the-art models.
Want to build this?
Explore a speech scenario
Speech services combined with Language Understanding enables apps and users to interact naturally. Use Speech to Text to capture a user’s question, Language Understanding to parse intent and formulate an appropriate reply and Text to Speech to synthesise the text into a spoken response. Create conversational interfaces for various scenarios like banking, travel and entertainment.
Together, the Azure Bot Service and Language Understanding service enable developers to create conversational interfaces for various scenarios like banking, travel and entertainment. For example, a hotel’s concierge can use a bot to enhance traditional e-mail and phone call interactions by validating a customer via Azure Active Directory and using Cognitive Services to better contextually process customer requests using text and voice. The Speech recognition service can be added to support voice commands.
- 1 Customer uses your mobile app
- 2 Using Azure AD B2C, the user authenticates
- 3 Using the custom Application Bot, user requests information
- 4 Cognitive Services helps process the natural language request
- 5 Response is reviewed by customer who can refine the question using natural conversation
- 6 Once the user is happy with the results, the Application Bot updates the customer’s reservation
- 7 Application insights gathers runtime telemetry to help development with Bot performance and usage
Explore the Cognitive Services APIs
An easy-to-use, ad-free, commercial-grade search tool that lets you deliver the results you want
Use the Speech Devices SDK to build an ambient device and create a custom wake wordLearn More