Skip Navigation

Bing Speech

Convert audio to text, understand intent, and convert text back to speech for natural responsiveness

Text to Speech

Convert text to spoken audio. When applications need to “talk” back to their users, this API can be used to convert text that is generated by the app into audio that can be played back to the user.

The Text-To-Speech API enables you to build smart apps that can speak. You can test it now, simply choose your target language, add your sentences then click on the play button to see how speech synthesis works. When you use this demo you consent to providing your voice input data to Microsoft for service improvement purposes.

See it in action

500 characters left

Want to build this?

Explore the Cognitive Services APIs

Computer Vision

Distill actionable information from images


Detect, identify, analyze, organize, and tag faces in photos

Video Indexer

Unlock video insights

Content Moderator

Automated image, text, and video moderation

Custom Vision PREVIEW

Easily customize your own state-of-the-art computer vision models for your unique use case

Text Analytics

Easily evaluate sentiment and topics to understand what users want

Translator Text

Easily conduct machine translation with a simple REST API call

Bing Spell Check

Detect and correct spelling mistakes in your app

Content Moderator

Automated image, text, and video moderation

Language Understanding

Teach your apps to understand commands from your users

Speaker Recognition PREVIEW

Use speech to identify and verify individual speakers

Speech Services

Unified speech services for speech-to-text, text-to-speech and speech translation

QnA Maker

Distill information into conversational, easy-to-navigate answers

Ready to supercharge your app?