Skip Navigation
Can we help you?

Speech translation

Easily integrate real-time speech translation to your app

Enable multilingual communication

Translate audio from more than 30 languages and customize your translations for your organization’s specific terms—all in your preferred programming language.


Benefit from fast, reliable speech translation powered by neural machine translation technology.

Customizable translations

Tailor models to recognize domain-specific terminology and unique speaking styles.

Normalized text

Deliver readable translations with an engine trained to normalize speech output.

Built-in security

Your data stays yours—your speech input is not logged during processing.

Try Speech Translation with this demo app, built on our JavaScript SDK

Your speech data will not be stored

Add high-quality translations to your apps

Generate speech-to-speech and speech-to-text translations with a single API call. Speech Translation captures the context of full sentences to provide accurate, fluent translations and improve communication between speakers of different languages.

Tailor translations to reflect domain-specific terminology

Customize speech recognition and translation for terminology specific to your business or industry. Train and deploy a custom translation system—without requiring machine learning expertise.

Normalize text for better translations

Speech Translation can remove verbal fillers ("um," "uh," and coughs) and repeated words, add proper punctuation and capitalization, and exclude profanities for more readable translations.

Fuel App Innovation with Cloud AI Services

Learn 5 key ways your organization can get started with AI to realize value quickly.

Comprehensive privacy and security

  • The Speech service, part of Azure Cognitive Services, is certified by SOC, FedRamp, PCI, HIPAA, HITECH, and ISO.
  • You control your data. Your audio input and translation data are not logged during audio processing.
  • View or delete any of your custom translator data and models at any time. Your data is encrypted while it’s in storage.
  • Backed by Azure infrastructure, the Speech service offers enterprise-grade security, availability, compliance, and manageability.

Flexible pricing gives you the power and control you need

  • Pay only for what you use, with no upfront costs.
  • With Speech Translation, you pay as you go, based on hours of audio translated.

Documentation and resources

Get started

Read our documentation

Take the Microsoft Learn course

Explore code samples

Check out our sample code

See customization resources

Customize your speech solution with Speech Studio. No code required.

Built with Speech Translation

Cheetah Mobile’s global app connects users

Leading mobile internet company, Cheetah Mobile, uses Speech Translation to bring their mobile app to international markets with high quality, low-latency translations.

Cheetah Mobile

Qatar research institute uses AI for global impact

The Qatar Computing Research Institute uses Speech Translation for video captioning across multiple languages, providing decision-makers with actionable data for disaster management while saving time and costs.

Qatar Computing Research Institute

Zencity improves quality of life with AI solutions

Data and analytics startup Zencity uses Speech Translation to analyze data from a variety of sources—social media, customer conversations, and more—to help governments make data-driven decisions to provide better services for their residents.


Get started with Speech