Identify who is speaking. The API can be used to determine the identity of an unknown speaker. Input audio of the unknown speaker is paired against a group of selected speakers, and if a match is found, the speaker’s identity is returned.
We have selected five different US presidents and enrolled them to the service using one of the speeches they gave. To see how the demo works, select a speech for one of the presidents by clicking on the sample audios below, or upload one of your own, to test how to automatically identify which president is speaking.
See it in action
Want to build this?
Have a look at the other Cognitive Services APIs
Distil actionable information from images
Automated image, text and video moderation
Video API PREVIEW
Intelligent video processing
Video Indexer PREVIEW
Unlock video insights
Teach your apps to understand commands from your users
Detecting and correcting spelling mistakes in your app
Web Language Model API PREVIEW
Use the power of predictive language models trained on web-scale data
Easily conduct real-time speech translation with a simple REST API call
Give your app intelligent autosuggest options for searches
Search for news and get comprehensive results
Get enhanced search details from billions of web documents
Recommendations API PREVIEW
Predict and recommend items that your customers want
Knowledge Exploration Service PREVIEW
Enable interactive search experiences over structured data via natural language inputs
Power your app’s data links with named entity recognition and disambiguation.