Speaker Recognition API

Identify individual speakers or use speech as a means of authentication with the Speaker Recognition API

Speaker Verification

Use your voice for verification. The API can be used to power applications with an intelligent verification tool. If the speaker claims to be of a certain identity, use voice to verify this claim.

To see how is works, select a pass phrase from the given list of phrases. Use that phrase and record three audio samples to register your voice with the service, this step is called "enrolment". After your enrolment is completed, you can start the verification step using a different voice recording or phrase to test the service.

See it in action

"i am going to make him an offer he cannot refuse"

Read the phrase above three times to enrol your voice.

1
2
3

By uploading data for this demo, you agree that Microsoft may store it and use it to improve Microsoft services, including this API. To help protect your privacy, we take steps to de-identify your data and keep it secure. We shall not publish your data or let other people use it.

Want to build this?

Speaker Identification

Identify who is speaking. The API can be used to determine the identity of an unknown speaker. Input audio of the unknown speaker is paired against a group of selected speakers and in the case there is a match found, the speaker’s identity is returned.

We have selected 5 different US presidents and enroled them to the service using one of the speeches they gave. To see how the demo works, select a speech for one of the presidents by clicking on the sample audios below or upload one of your own, to test how to automatically identify which president is speaking.

See it in action

Enrolment speech
Enrolment speech
Enrolment speech
Enrolment speech
Enrolment speech
Enrolment speech

Want to build this?

Explore the Cognitive Services APIs

Computer Vision API

Distill actionable information from images

Face API

Detect, identify, analyze, organise, and tag faces in photos

Content Moderator

Cost-effective moderation of text, image and video content

Emotion API PREVIEW

Personalise experiences with emotion recognition

Video API PREVIEW

Intelligent video processing produces stable video output, detects motion, creates intelligent thumbnails and detects and tracks faces

Custom Vision Service PREVIEW

A customisable web service that learns to recognise specific content in imagery

Video Indexer PREVIEW

Search, edit, analyse and learn from your videos

Language Understanding Intelligent Service PREVIEW

Teach your apps to understand commands from your users

Text Analytics API PREVIEW

Easily evaluate sentiment and topics to understand what users want

Bing Spell Check API

Help users correct spelling errors, recognise the difference among names, brand names and slang, as well as understand homophones as they are typing

Translator Text API

Easily conduct real-time text translation with a simple REST API call

Web Language Model API PREVIEW

Use the power of predictive language models trained on web-scale data

Linguistic Analysis API PREVIEW

Simplify complex language concepts and parse text with the Linguistic Analysis API

Translator Speech API

Conduct real-time speech translations

Speaker Recognition API PREVIEW

Use speech to identify and authenticate individual speakers

Bing Speech API

Convert speech to text and back again to understand user intent

Custom Speech Service PREVIEW

Eliminate speech recognition barriers like speaking style, background noise and vocabulary

Recommendations API PREVIEW

Predict and recommend items your customers want

Academic Knowledge API PREVIEW

Tap into academic content in the Microsoft Academic Graph

Knowledge Exploration Service PREVIEW

Enable interactive search experiences over structured data via natural language inputs.

QnA Maker API PREVIEW

Distill information into conversational, easy-to-navigate answers.

Entity Linking Intelligence Service API PREVIEW

Power your app's data links with named entity recognition and disambiguation

Custom Decision Service PREVIEW

A cloud-based, contextual decision-making API that sharpens with experience

Project Prague

Gesture based controls

Project Cuzco

Event associated with Wikipedia entries

Project Nanjing

Isochrones calculations

Project Abu Dhabi

Distance Matrix

Project Johannesburg

Route logistics

Project Wollongong

Location insights

Ready to supercharge your app?