Bing Speech API
Convert speech to text and back again, and understand its intent
Convert spoken audio to text in real time—whether it’s audio from a file or live through microphone or other audio source. Gain the option for real-time streaming as well, so that, as the audio is being sent to the server, partial recognition results are being returned.
Speech intent recognition
Convert spoken audio to intent that drives actions. Using Language Understanding Intelligent Service models, speech intent recognition lets your application not only convert spoken audio to text, but also easily parse the intent of the speaker to create actions within the app, such as “set an alarm”.
Text to speech conversion
Convert text to spoken audio. When applications need to “talk” back to their users, this API converts text generated by the app into audio that can be played back to the user.
Have a look at the other Cognitive Services APIs
Teach your apps to understand commands from your users
Web Language Model API PREVIEW
Use the power of predictive language models trained on web-scale data
Bing Speech API PREVIEW
Convert speech to text and back again to understand user intent
Custom Speech Service PREVIEW
Overcome speech recognition barriers like speaking style, background noise and vocabulary