Speech to Text – Converts spoken audio to text for intuitive interaction
Easily add real-time speech-to-text capabilities to your applications for scenarios like voice commands, conversation transcription and call center log analysis.
Tailor your speech recognition models to adapt to users’ speaking styles, expressions and unique vocabularies and to accommodate background noises, accents and voice patterns.Learn More
Text to Speech – Give natural voice to your apps
Build smart apps and services which speak to users naturally with the Text to Speech service. Convert text to audio in near real time, tailor to change the speed of speech, pitch, volume and more.
Give your application a one-of-a-kind, recognisable brand voice using custom voice models. Simply record and upload training data and the service will create a unique voice font tuned to your recording.Learn More
Give your app real-time speech translation capabilities in any of the supported languages and receive either a text or speech translation back. Speech Translation models are based on leading-edge speech recognition and neural machine translation (NMT) technologies. They are optimised to understand the way people speak in real life and generate translations of exceptional quality.Learn More
Business scenarios built on Speech Services
Easily transcribe every call and optimise results through batch transcription and custom speech services enhanced for call center scenarios. Index call transcriptions for full-text search or apply text analytics to detect sentiment, language and key phrases for insights.Learn More
"We are impressed with the initial transcription accuracy of Custom Speech and Speaker Recognition. We are now working to optimise for a live environment which would be breakthrough for British Telecom Sport versus the current manual process."