Speaker Recognition API

Identify individual speakers or use speech as a means of authentication with the Speaker Recognition API

Speaker identification

Identify who is speaking. The API can be used to determine the identity of an unknown speaker. Input audio of the unknown speaker is paired against a group of selected speakers, and if a match is found, the speaker’s identity is returned.

We have selected five different US presidents and enrolled them to the service using one of the speeches they gave. To see how the demo works, select a speech for one of the presidents by clicking on the sample audios below, or upload one of your own, to test how to automatically identify which president is speaking.

See it in action
Play Audio White Stop Audio White Enrolment speech
Enrolment speech
Enrolment speech
Enrolment speech
Enrolment speech
Enrolment speech

Want to build this?

Have a look at the other Cognitive Services APIs

Computer Vision API

Distil actionable information from images

Content Moderator

Automated image, text and video moderation

Video API PREVIEW

Intelligent video processing

Video Indexer PREVIEW

Unlock video insights

Face API

Detect, analyse, organise and tag faces in photos

Emotion API PREVIEW

Personalise user experiences with emotion recognition

Custom Vision Service PREVIEW

Easily customise your own state-of-the-art computer vision models for your unique use case.

Language Understanding Intelligent Service PREVIEW

Teach your apps to understand commands from your users

Bing Spell Check API

Detecting and correcting spelling mistakes in your app

Web Language Model API PREVIEW

Use the power of predictive language models trained on web-scale data

Translator Speech API

Easily conduct real-time speech translation with a simple REST API call

Text Analytics API PREVIEW

Easily evaluate sentiment and topics to understand what users want

Translator Text API

Easily conduct automatic text translation with a simple REST API call

Linguistic Analysis API PREVIEW

Simplify complex language concepts and parse text with the Linguistic Analysis API.

Custom Speech Service PREVIEW

Overcome speech recognition barriers like speaking style, background noise and vocabulary

Bing Speech API

Convert speech to text and back again to understand user intent

Speaker Recognition API PREVIEW

Use speech to identify and authenticate individual speakers

Recommendations API PREVIEW

Predict and recommend items that your customers want

Knowledge Exploration Service PREVIEW

Enable interactive search experiences over structured data via natural language inputs

Entity Linking Intelligence Service API PREVIEW

Power your app’s data links with named entity recognition and disambiguation.

Academic Knowledge API

Tap into the wealth of academic content in the Microsoft Academic Graph

QnA Maker API PREVIEW

Distil information into conversational, easy-to-navigate answers.

Custom Decision Service PREVIEW

A cloud-based, contextual decision-making API that sharpens with experience

Project Prague

Gesture-based controls

Nanjing Project

Isochrones calculations

Project Johannesburg

Route logistics

Project Cuzco

Event associated with Wikipedia Entries

Project Abu Dhabi

Distance Matrix

Project Wollongong

Location insights

Ready to supercharge your app?