Speaker Recognition

Identify individual speakers or use speech as a means of authentication with Speaker Recognition

Speaker verification

Use your voice for verification. The API can be used to power applications with an intelligent verification tool. If the speaker claims to be of a certain identity, use voice to verify this claim.

To see how this works, select a pass phrase from the given list of phrases. Use this phrase and record three audio samples to register your voice with the service. This step is called “enrolment”. After you have enrolled, you can start the verification step using a different voice recording or phrase to test the service.

See it in action

"i am going to make him an offer he cannot refuse"

Read the phrase above three times to enrol your voice.

1
2
3

Want to build this?

Speaker identification

Identify who is speaking. The API can be used to determine the identity of an unknown speaker. Input audio of the unknown speaker is paired against a group of selected speakers, and if a match is found, the speaker’s identity is returned.

We have selected five different US presidents and enrolled them to the service using one of the speeches they gave. To see how the demo works, select a speech for one of the presidents by clicking on the sample audios below, or upload one of your own, to test how to automatically identify which president is speaking.

See it in action

President Barack Obama
President George W. Bush
President William J. Clinton
President George H. W. Bush
President Ronald Reagan
President Jimmy Carter

Want to build this?

Explore the Cognitive Services APIs

Computer Vision

Distill actionable information from images

Face

Detect, identify, analyse, organise and tag faces in photos

Ink Recogniser

An AI service that recognises digital ink content, such as handwriting, shapes and ink document layout

Video Indexer

Unlock video insights

Custom Vision

Easily customise your own state-of-the-art computer vision models for your unique use case

Form Recogniser

The AI-powered document extraction service that understands your forms

Text Analytics

Easily evaluate sentiment and topics to understand what users want

Translator Text

Easily conduct machine translation with a simple REST API call

QnA Maker

Distill information into conversational, easy-to-navigate answers

Language Understanding

Teach your apps to understand commands from your users

Immersive Reader

Empower users of all ages and abilities to read and comprehend text

Speech services

Unified speech services for speech-to-text, text-to-speech and speech translation

Speaker Recognition

Use speech to identify and verify individual speakers

Speech translation

Easily integrate real-time speech translation to your app

Speech-to-Text

The Speech to Text API is part of Azure Cognitive Services Speech Services

Text to Speech

Convert text to speech to create more natural, accessible interfaces

Content moderator

Automated image, text and video moderation

Anomaly detector

Easily add anomaly detection capabilities to your apps.

Personaliser

An AI service that delivers a personalised user experience

Ready to supercharge your app?