Build smart apps and services that speak to users naturally with the Text to Speech service. Convert text to audio in near real time, tailor to change the speed of speech, pitch, volume and more.
Give your application a one-of-a-kind, recognisable brand voice using custom voice models. Simply record and upload training data, and the service will create a unique voice font tuned to your recording.
Give your app real-time speech translation capabilities in any of the supported languages and receive either a text or speech translation back. Speech translation models are based on leading-edge speech recognition and neural machine translation (NMT) technologies. They’re optimised to understand the way people speak in real life and generate translations of exceptional quality.
Easily transcribe every call and optimise results through batch transcription and custom speech services enhanced for call centre scenarios. Index call transcriptions for full-text search, or apply text analytics to detect sentiment, language and key phrases for insights.
"We are impressed with the initial transcription accuracy of Custom Speech and Speaker Recognition. We are now working to optimise for a live environment which would be breakthrough for British Telecom Sport versus the current manual process."
Kevin Blyth, British Telecom Research and Innovation