You’re almost ready to start building with your seven-day free evaluation.
Sign in with your preferred account to get started
Speech to Text – Converts spoken audio to text for intuitive interaction
Easily add real-time speech-to-text conversion to your applications for cases such as voice commands, real-time transcriptions or call centre log analysis.
Tailor your speech recognition models to adapt to users’ speaking styles, expressions or unique vocabulary, and to accommodate specific background noises, accents and voice patterns depending on your scenario.
Build smart apps and services that speak to users naturally with the Text to Speech service. Convert text to audio in near real time, tailor to change the speed of speech, pitch, volume and more.
Give your application a one-of-a-kind, recognisable brand voice using custom voice models. Simply record and upload training data, and the service will create a unique voice font tuned to your recording.
Give your app real-time speech translation capabilities in any of the supported languages and receive either a text or speech translation back. Speech translation models are based on leading-edge speech recognition and neural machine translation (NMT) technologies. They’re optimised to understand the way people speak in real life and generate translations of exceptional quality.
"We are impressed with the initial transcription accuracy of Custom Speech and Speaker Recognition. We are now working to optimise for a live environment which would be breakthrough for British Telecom Sport versus the current manual process."
Kevin Blyth, British Telecom Research and Innovation