Skip navigation

Speaker Recognition

Identify individual speakers or use speech as a means of authentication with Speaker Recognition

Speaker verification

Use your voice for verification. The API can be used to power applications with an intelligent verification tool. If the speaker claims to be of a certain identity, use voice to verify this claim.

To see how this works, select a pass phrase from the given list of phrases. Use this phrase and record three audio samples to register your voice with the service. This step is called “enrolment”. After you have enrolled, you can start the verification step using a different voice recording or phrase to test the service.

See it in action

"i am going to make him an offer he cannot refuse"

Read the phrase above three times to enrol your voice.

1
2
3

Want to build this?

Speaker identification

Identify who is speaking. The API can be used to determine the identity of an unknown speaker. Input audio of the unknown speaker is paired against a group of selected speakers, and if a match is found, the speaker’s identity is returned.

We have selected five different US presidents and enrolled them to the service using one of the speeches they gave. To see how the demo works, select a speech for one of the presidents by clicking on the sample audios below, or upload one of your own, to test how to automatically identify which president is speaking.

See it in action

President Barack Obama
President George W. Bush
President William J. Clinton
President George H. W. Bush
President Ronald Reagan
President Jimmy Carter

Want to build this?

Explore the Cognitive Services APIs

Computer Vision

Distill actionable information from images

Face

Detect, identify, analyse, organise and tag faces in photos

Video Indexer

Unlock video insights

Content moderator

Automated image, text and video moderation

Custom Vision PREVIEW

Easily customise your own state-of-the-art computer vision models for your unique use case

Text Analytics

Easily evaluate sentiment and topics to understand what users want

Translator Text

Easily conduct machine translation with a simple REST API call

Bing Spell Check

Detecting and correcting spelling mistakes in your app

Content moderator

Automated image, text and video moderation

Language Understanding

Teach your apps to understand commands from your users

Speech services

Unified speech services for speech-to-text, text-to-speech and speech translation

Speaker Recognition PREVIEW

Use speech to identify and verify individual speakers

QnA Maker

Distill information into conversational, easy-to-navigate answers

Ready to supercharge your app?