Skip Navigation

Text to speech

A Speech service feature that converts text to lifelike speech

Bring your apps to life with natural-sounding voices

Build apps and services that speak naturally. Differentiate your brand with a customized, realistic voice generator, and access voices with different speaking styles and emotional tones to fit your use case—from text readers and talkers to customer support chatbots.

Lifelike synthesized speech

Enable fluid, natural-sounding text to speech that matches the intonation and emotion of human voices.

Customizable text-talker voices

Create a unique AI voice generator that reflects your brand's identity.

Fine-grained text-to-talk audio controls

Tune voice output for your scenarios by easily adjusting rate, pitch, pronunciation, pauses, and more.

Flexible deployment

Run Text to Speech anywhere—in the cloud, on-premises, or at the edge in containers.

Access a wide variety of voices for every scenario

Engage global audiences by using 400 neural voices across 140 languages and variants. Bring your scenarios like text readers and voice-enabled assistants to life with highly expressive and human-like voices. Neural Text to Speech supports several speaking styles including newscast, customer service, shouting, whispering, and emotions like cheerful and sad.

Try Text to Speech with this demo app, built on our JavaScript SDK

Note: Your data will not be stored.

Learn how to build this

Note: Your data will not be stored.

Learn how to build this

Tailor your speech output

Fine-tune synthesized speech audio to fit your scenario. Define lexicons and control speech parameters such as pronunciation, pitch, rate, pauses, and intonation with Speech Synthesis Markup Language (SSML) or with the audio content creation tool.

Deploy Text to Speech anywhere, from the cloud to the edge

Run Text to Speech wherever your data resides. Build lifelike speech synthesis into applications optimized for both robust cloud capabilities and edge locality using containers.

Build a custom voice for your brand

Differentiate your brand with a unique custom voice. Develop a highly realistic voice for more natural conversational interfaces using the Custom Neural Voice capability, starting with 30 minutes of audio. Here are a few examples of organizations that are doing AI voice generation today:

Swisscom improves customer experiences with multilingual voice assistant

Swisscom used Speech service to create a natural sounding custom text-to-speech voice assistant with voice personas that are unique to Swisscom across English, French, German, and Italian.

Read the story

AT&T delights customers with immersive experiences

AT&T is showcasing the power of its 5G network with an immersive experience that allows its customers to talk directly to Bugs Bunny*.

*LOONEY TUNES and all related characters and elements © & ™ Warner Bros. Entertainment Inc. (s21)

Watch the video

Progressive brings Flo directly to its customers

Progressive used custom neural voice to build a natural-sounding, virtual version of Flo to help customers with everything from getting a free car insurance quote to general insurance questions.

Read the story

Fuel App Innovation with Cloud AI Services

Learn five key ways your organization can get started with AI to realize value quickly.

Comprehensive privacy and security

  • The Speech service, part of Azure Cognitive Services, is certified by SOC, FedRAMP, PCI DSS, HIPAA, HITECH, and ISO.
  • Your data remains yours. Your text data isn't stored during data processing or audio voice generation.
  • View and delete your custom voice data and synthesized speech models at any time. Your data is encrypted while it’s in storage.
  • Backed by Azure infrastructure, the Speech service offers enterprise-grade security, availability, compliance, and manageability.

Flexible pricing gives you the power and control you need

Pay only for what you use, with no upfront costs. With Text to Speech, you pay as you go based on the number of characters you convert to audio.

Guidelines for building responsible synthetic voices

Learn about responsible deployment

Synthetic voices must be designed to earn the trust of others. Learn the principles of building synthesized voices that create confidence in your company and services.

Obtain consent from voice talent

Help voice talent understand how neural text-to-speech (TTS) works and get information on recommended use cases.

Be transparent

Transparency is foundational to responsible use of computer voice generators and synthetic voices. Help ensure that users understand when they’re hearing a synthetic voice and that voice talent is aware of how their voice will be used. Learn more with our disclosure design guidelines.

Documentation and resources

Explore code samples

Check out the sample code

See customization resources

Customize your speech solution with Speech studio. No code required.

Built with Text to Speech

BBC innovates how it delivers trusted content

The BBC used Azure Cognitive Services and Azure Bot Service to create an end-to-end, customized digital voice assistant that captures its brand identity and establishes a conversational relationship with its broad audience.

BBC

Swisscom improves customer experiences with multi-lingual voice assistant

Swisscom used Speech service to create a natural sounding custom voice assistant with voice personas that are unique to Swisscom across English, French, German and Italian.

Swisscom

Motorola helps first responders access vital data

Motorola Solutions is helping police officers and other emergency first responders gain access to important information more quickly with a voice-powered virtual assistant.

Motorola Solutions

Universal Electronics powers connected smart homes

Universal Electronics is helping manufacturers deliver voice-enabled navigation and control capabilities that work across smart home devices.

Universal Electronics

Cheetah Mobile expands international translation

Cheetah Mobile, a mobile internet company with app users in more than 200 countries and regions, is using Text to Speech to expand accessibility of its translation device and app to international markets.

Cheetah Mobile

Ready when you are—let's set up your Azure free account

Can we help you?