Text to Speech
A Speech service feature that converts text to lifelike speech
Try Cognitive Services for free
Sign-in to Continue
You are almost ready to start building with your 7-day free evaluation.
Sign-in with your preferred account to get started
Bring your apps to life with natural-sounding voices
Build apps and services that speak naturally, choosing from more than 100 voices in over 40 languages. Differentiate your brand with a customised voice and access voices with different speaking styles and emotional tones to fit your use case—all in your preferred programming language.
Enable fluid, natural-sounding speech that matches the patterns and intonation of human voices.
Create a unique voice that reflects your brand’s identity.
Fine-grained audio controls
Tune voice output for your scenarios by easily adjusting rate, pitch, pronunciation, pauses and more.
Run Text to Speech anywhere—in the cloud or at the edge in containers.
Access a wide variety of voices for every scenario
Engage global audiences by using more than 100 voices and over 40 languages and variants. Bring your scenarios to life with highly expressive and humanlike voices. Neural Text to Speech supports several speaking styles, including chat, newscast and customer service and emotions like cheerfulness and empathy.
Build a custom voice for your brand
Differentiate your brand with a unique custom voice. Develop a highly realistic voice for more natural conversational interfaces using the custom neural voice capability (preview), starting with 30 minutes of audio.
|Sample Text||Voice Sample|
Want to start building your own voice model?
Deploy anywhere, from the cloud to the edge
Run Text to Speech wherever your data resides. Build speech applications that are optimised for both robust cloud capabilities and edge locality using containers (preview). Speech containers support both standard and custom voice.
Comprehensive privacy and security
- The Speech service, part of Azure Cognitive Services, is certified by SOC, FedRAMP, PCI DSS, HIPAA, HITECH and ISO.
- Your data remains yours. Your text data is not stored during data processing or audio generation.
- View and delete your custom voice data and models at any time. Your data is encrypted while it is in storage.
- Backed by Azure infrastructure, the Speech service offers enterprise-grade security, availability, compliance and manageability.
Flexible pricing gives you the power and control you need
Pay only for what you use, with no upfront costs. With Text to Speech, you pay as you go based on the number of characters you convert to audio.
Guidelines for building responsible synthetic voices
Learn about responsible deployment
Synthetic voices must be designed to earn the trust of others. Learn the principles of building synthetic voices that create confidence in your company and services.
Obtain consent from voice talent
Help voice talent understand how neural text-to-speech (TTS) works and get information on recommended use cases.