Speech translation

A Speech service feature that translates speech in real time

Enable multilingual communication

Translate audio from more than 30 languages and customise your translations for your organisation’s specific terms – all in your preferred programming language.

Production-ready

Benefit from fast, reliable translations powered by neural machine translation technology.

Customisable translations

Tailor models to recognise domain-specific terminology and unique speaking styles.

Normalised text

Deliver readable translations with an engine trained to normalise speech output.

Built-in security

Your data remains yours – your speech input is not logged during processing.

Try Speech Translation with this demo app, built on our JavaScript SDK

* These target languages can be synthesised with Text to Speech as part of your call to the Speech service.

Your speech data will not be stored

Add high-quality translations to your apps

Generate speech-to-speech and speech-to-text translations with a single API call. Speech Translation captures the context of full sentences to provide accurate, fluent translations and improve communication between speakers of different languages.

Tailor translations to reflect domain-specific terminology

Customise speech recognition and translation for terminology specific to your business or industry. Train and deploy a custom translation system – without requiring machine learning expertise.

Normalise text for better translations

Speech Translation can remove verbal fillers (“um”, “uh” and coughs) and repeated words, add proper punctuation and capitalisation, and exclude profanities for more readable translations.

Comprehensive privacy and security

  • The Speech service, part of Azure Cognitive Services, is certified by SOC, FedRamp, PCI, HIPAA, HITECH and ISO.
  • You control your data. Your audio input and translation data are not logged during audio processing.
  • View or delete any of your custom translator data and models at any time. Your data is encrypted while it’s in storage.
  • Backed by Azure infrastructure, the Speech service offers enterprise-grade security, availability, compliance and manageability.

Flexible pricing gives you the power and control you need

  • Only pay for what you use, with no upfront costs.
  • With Speech Translation, you pay as you go, based on hours of audio translated.

Documentation and resources

Getting started

Read our documentation

Take the Microsoft Learn course

Explore code samples

Take a look at our sample code

See customisation resources

Customise your speech solution with Speech Studio. No code required.

Built with Speech Translation

Cheetah Mobile’s global app connects users

Leading mobile Internet company, Cheetah Mobile, uses Speech Translation to bring their mobile app to international markets with high-quality, low-latency translations.

Cheetah Mobile

Qatar research institute uses AI for global impact

The Qatar Computing Research Institute uses Speech Translation for video captioning across multiple languages, providing decision-makers with actionable data for disaster management while saving time and costs.

Qatar Computing Research Institute

Zencity improves quality of life with AI solutions

Data and analytics startup Zencity uses Speech Translation to analyse data from a variety of sources – social media, customer conversations and more – to help governments make data-driven decisions to provide better services for their residents.

Zencity

Get started with Speech