Text to Speech
A Speech service feature that converts text to lifelike speech
Try Cognitive Services for free
Sign-in to Continue
You're almost ready to start building with your 7-day free evaluation.
Sign-in with your preferred account to get started
Bring your apps to life with natural-sounding voices
Build apps and services that speak naturally, choosing from more than 100 voices in over 40 languages. Differentiate your brand with a customized voice, and access voices with different speaking styles and emotional tones to fit your use case—all in your preferred programming language.
Enable fluid, natural-sounding speech that matches the patterns and intonation of human voices.
Create a unique voice that reflects your brand’s identity.
Fine-grained audio controls
Tune voice output for your scenarios by easily adjusting rate, pitch, pronunciation, pauses, and more.
Run Text to Speech anywhere—in the cloud or at the edge in containers.
Access a wide variety of voices for every scenario
Engage global audiences by using more than 100 voices and over 40 languages and variants. Bring your scenarios to life with highly expressive and humanlike voices. Neural Text to Speech supports several speaking styles, including chat, newscast, and customer service, and emotions like cheerfulness and empathy.
Build a custom voice for your brand
Differentiate your brand with a unique custom voice. Develop a highly realistic voice for more natural conversational interfaces using the custom neural voice capability (preview), starting with 30 minutes of audio.
|Sample Text||Voice Sample|
Want to start building your own voice model?
Deploy anywhere, from the cloud to the edge
Run Text to Speech wherever your data resides. Build speech applications that are optimized for both robust cloud capabilities and edge locality using containers (preview). Speech containers support both standard and custom voice.
Comprehensive privacy and security
- The Speech service, part of Azure Cognitive Services, is certified by SOC, FedRAMP, PCI DSS, HIPAA, HITECH, and ISO.
- Your data remains yours. Your text data isn’t stored during data processing or audio generation.
- View and delete your custom voice data and models at any time. Your data is encrypted while it’s in storage.
- Backed by Azure infrastructure, the Speech service offers enterprise-grade security, availability, compliance, and manageability.
Flexible pricing gives you the power and control you need
Pay only for what you use, with no upfront costs. With Text to Speech, you pay as you go based on the number of characters you convert to audio.
Guidelines for building responsible synthetic voices
Learn about responsible deployment
Synthetic voices must be designed to earn the trust of others. Learn the principles of building synthetic voices that create confidence in your company and services.
Obtain consent from voice talent
Help voice talent understand how neural text-to-speech (TTS) works and get information on recommended use cases.
Documentation and resources
Explore code samples
See customization resources
Built with Text to Speech
Motorola helps first responders access vital data
Motorola Solutions is helping police officers and other emergency first responders gain access to important information more quickly with a voice-powered virtual assistant.
BBC innovates how it delivers trusted content
Using Azure Cognitive Services and Azure Bot Service, the BBC created an end-to-end, customized digital voice assistant that captures its brand identity and helps it establish a new conversational relationship with its broad audiences.
Universal Electronics powers connected smart homes
Universal Electronics is helping manufacturers deliver voice-enabled navigation and control capabilities that work across smart home devices.
Cheetah Mobile expands international translation
Cheetah Mobile, a mobile internet company with app users in more than 200 countries and regions, is using Text to Speech to expand accessibility of its translation device and app to international markets.