Bing Speech API
Convert speech to text and back again, and understand its intent
Convert spoken audio to text in real time—whether it’s audio from a file or live through microphone or other audio source. Also gain the option for real-time streaming, so as the audio is being sent to the server, partial recognition results are also being returned.
Speech intent recognition
Convert spoken audio to intent that drive actions. Using Language Understanding Intelligent Service models, speech intent recognition lets your application not only convert spoken audio to text, easily parse the intent of the speaker to create actions within the app, such as “set an alarm.”
Text to speech conversion
Convert text to spoken audio. When applications need to “talk” back to their users, this API converts text generated by the app into audio that can be played back to the user.
Check out the other Cognitive Services APIs
Teach your apps to understand commands from your users
Web Language Model API PREVIEW
Use the power of predictive language models trained on web-scale data
Bing Speech API PREVIEW
Convert speech to text and back again to understand user intent
Custom Speech Service PREVIEW
Overcome speech recognition barriers like speaking style, background noise, and vocabulary