Speech to Text
A Speech service feature that accurately converts spoken audio to text
Try Cognitive Services for free
Sign-in to Continue
You're almost ready to start building with your 7-day free evaluation.
Sign-in with your preferred account to get started
Make spoken audio actionable
Quickly and accurately transcribe audio to text in more than 40 languages and variants. Customize models to enhance accuracy for domain-specific terminology. Get more value from spoken audio by enabling search or analytics on transcribed text or facilitating action—all in your preferred programming language.
Get accurate transcriptions with state-of-the-art speech recognition.
Add specific words to your base vocabulary or build your own models.
Run Speech to Text anywhere—in the cloud or at the edge in containers.
Access the same robust technology that powers speech recognition across Microsoft products.
To try out the demo with your own voice using a microphone, please change to a different browser with WebRTC support, for example a recent version of Microsoft Edge, Firefox or Chrome.Microphone access was rejected.
Your speech data will not be stored
Customize speech models to your needs
Tailor your speech models to understand organization- and industry-specific terminology. Overcome speech recognition barriers such as background noise, accents, or unique vocabulary. Customize your models by uploading audio data and transcripts. Automatically generate custom models using Office 365 data to optimize speech recognition accuracy for your organization.
Deploy anywhere, from the cloud to the edge
Run Speech to Text wherever your data resides. Build speech applications that are optimized for both robust cloud capabilities and edge locality using containers (preview). Speech containers support both standard and custom speech.
Comprehensive privacy and security
- The Speech service, part of Azure Cognitive Services, is certified by SOC, FedRAMP, PCI DSS, HIPAA, HITECH, and ISO.
- Your data remains yours. Your audio input and transcription data aren’t logged during audio processing.
- View and delete your custom speech data and models at any time. Your data is encrypted while it’s in storage.
- Backed by Azure infrastructure, the Speech service offers enterprise-grade security, availability, compliance, and manageability.
Flexible pricing gives you the power and control you need
Pay only for what you use, with no upfront costs. With Speech to Text, you pay as you go based on the number of hours of audio you transcribe.
Built with Speech to Text
KPMG streamlines call transcription
KPMG uses Speech to Text to transcribe and catalog thousands of hours of calls, reducing compliance costs for its clients by as much as 80 percent.
Motorola helps first responders access vital data using voice
Motorola Solutions is helping police officers and other emergency first responders gain faster access to important information with a voice-powered virtual assistant.
Universal Electronics delivers voice-enabled smart home experiences
Universal Electronics is helping brands deliver voice-enabled navigation and control capabilities that work across everyday devices found in the home—offering a truly unique consumer experience.
Hochtief documents construction defects using voice
Hochtief is helping project managers identify and document construction defects at project sites with a voice-enabled virtual assistant.
NTT DATA accelerates decision-making with meeting insights
NTT DATA is unlocking insights from speech data with real-time meeting transcription. With Custom Speech, they are able to customize speech recognition models to understand organization-specific terms.
Insight powers conversational banking experiences
Insight Enterprises is helping banks bring digital speed and convenience to their branches with a conversational-AI powered banking solution. Speech to Text converts what customers say into data that can be processed and analyzed so that customers can get timely, relevant responses.