Speaker Recognition

Identify individual speakers or use speech as a means of authentication with Speaker Recognition

Speaker Verification

Use your voice for verification. The API can be used to power applications with an intelligent verification tool. If the speaker claims to be of a certain identity use voice to verify this claim.

To see how is works, select a pass phrase from the given list of phrases. Use that phrase and record three audio samples to register your voice with the service, this step is called "enrollment". After your enrollment is completed, you can start the verification step using a different voice recording or phrase to test the service.

See it in action

"i am going to make him an offer he cannot refuse"

Read the phrase above three times to enroll your voice.

1
2
3

Want to build this?

Speaker Identification

Identify who is speaking. The API can be used to determine the identity of an unknown speaker. Input audio of the unknown speaker is paired against a group of selected speakers, and in the case there is a match found, the speaker’s identity is returned.

We have selected 5 different US presidents and enrolled them to the service using one of the speeches they gave. To see how the demo works, select a speech for one of the presidents by clicking on the sample audios below, or upload one of your own, to test how to automatically identify which president is speaking.

See it in action

President Barack Obama
President George W Bush
President William J Clinton
President George H W Bush
President Ronald Reagan
President Jimmy Carter

Want to build this?

Explore the Cognitive Services APIs

Computer Vision

Distill actionable information from images

Face

Detect, identify, analyze, organize, and tag faces in photos

Video Indexer

Unlock video insights

Content Moderator

Automated image, text, and video moderation

Custom Vision

Easily customize your own state-of-the-art computer vision models for your unique use case

Text Analytics

Easily evaluate sentiment and topics to understand what users want

Translator Text

Easily conduct machine translation with a simple REST API call

Bing Spell Check

Detect and correct spelling mistakes in your app

QnA Maker

Distill information into conversational, easy-to-navigate answers

Content Moderator

Automated image, text, and video moderation

Language Understanding

Teach your apps to understand commands from your users

Speech to Text

The Speech to Text API is part of Azure Cognitive Services Speech Services

Speaker Recognition PREVIEW

Use speech to identify and verify individual speakers

Text to Speech

Convert text to speech to create more natural, accessible interfaces

Speech Translation

Easily integrate real-time speech translation to your app

Anomaly Detector PREVIEW

Easily add anomaly detection capabilities to your apps.

Ready to supercharge your app?