Try Cognitive Services for free
Sign in to create your API
You're almost ready to start building with your 7-day free evaluation.
Log in using your preferred account
Convert spoken audio to text. The API can be directed to turn on and recognise audio coming from the microphone in real time, recognise audio coming from a different real-time audio source or recognise audio from within a file. In all cases, real-time streaming is available, so that, as the audio is being sent to the server, partial recognition results are also being returned.
The Speech-to-text API enables you to build smart apps that are triggered by voice. To see how it works, select your target language, then click on the microphone and start speaking. Or simply click on one of the sample speech phrases to see how speech recognition works. When you use this demo, you consent to providing your voice input data to Microsoft for service improvement purposes.
See it in action
To try out the demo with your own voice using a microphone, please change to a different browser that supports WebRTC, for example a recent version of Microsoft Edge, Firefox or Chrome.Microphone access was rejected.
Text to Speech
Convert text to spoken audio. When applications need to “talk” back to their users, this API can be used to convert text that is generated by the app into audio that can be played back to the user.
The Text-to-speech API enables you to build smart apps that can speak. You can test it now – simply choose your target language, add your sentences, then click on the play button to see how speech synthesis works. When you use this demo, you consent to providing your voice input data to Microsoft for service improvement purposes.
See it in action
“Microsoft Cognitive Services gives us a huge range of opportunities. It’s a perfect match for us now, and in the future when we want to add more features to our app.”
Jaan Apajalahti: CEO | Blucup
“Using the Cognitive Services APIs, it took us three months to develop a test pair of glasses that can translate text and images into speech, identify emotions and describe scenery. If we had been working full-time, we could have done it in two weeks.”
Benoit Chirouter: R&D Director | Pivothead
Explore the Cognitive Services APIs
An easy-to-use, advert-free, commercial-grade search tool that lets you deliver the results you want