Custom Speech Service

Overcome speech recognition barriers such as speaking style, vocabulary and background noise.

Create custom language models

Customise the language model of the speech recogniser by tailoring it to the vocabulary of the application and the speaking style of your users.

Create custom acoustic models

Customise the acoustic model of the speech recogniser to better match the expected environment and user population of your application.

Deploy your custom models

Deploy your models to create a speech recognition endpoint that’s customised to your application.

Access your endpoint from any device

Send requests to your custom endpoint using RESTful API or the cognitive services speech client library.

Explore the Cognitive Services APIs

Computer Vision API

Distill actionable information from images

Face API

Detect, identify, analyse, organise and tag faces in photos

Content moderator

Cost-effective moderation of text, image and video content

Emotion API PREVIEW

Personalise experiences with emotion recognition

Video API PREVIEW

Intelligent video processing produces a stable video output, detects motion, creates intelligent thumbnails and detects and tracks faces

Custom Vision Service PREVIEW

A customisable web service that learns to recognise specific content in imagery

Video Indexer PREVIEW

Search, edit, analyse and learn from your videos

Language Understanding Intelligent Service PREVIEW

Teach your apps to understand commands from your users

Text Analytics API PREVIEW

Easily evaluate sentiment and topics to understand what users want

Bing Spell Check API

Help users correct spelling errors, recognise the difference among names, brand names and slang, as well as understand homophones as they’re typing

Translator Text API

Easily conduct real-time text translation with a simple REST API call

Web Language Model API PREVIEW

Use the power of predictive language models trained on web-scale data

Linguistic Analysis API PREVIEW

Simplify complex language concepts and parse text with the Linguistic Analysis API

Translator Speech API

Conduct real-time speech translations

Speaker Recognition API PREVIEW

Use speech to identify and authenticate individual speakers

Bing Speech API

Convert speech to text and back again to understand user intent

Custom Speech Service PREVIEW

Eliminate speech recognition barriers, such as speaking style, background noise and vocabulary

Recommendations API PREVIEW

Predict and recommend items your customers want

Academic Knowledge API PREVIEW

Tap into academic content in the Microsoft Academic Graph

Knowledge Exploration Service PREVIEW

Enable interactive search experiences over structured data via natural language inputs.

QnA Maker API PREVIEW

Distil information into conversational, easy-to-navigate answers.

Entity Linking Intelligence Service API PREVIEW

Power your app’s data links with named entity recognition and disambiguation

Custom Decision Service PREVIEW

A cloud-based, contextual decision-making API that sharpens with experience

Project Prague

Gesture-based controls

Project Cuzco

Event associated with Wikipedia entries

Project Nanjing

Isochrones calculations

Project Abu Dhabi

Distance matrix

Project Johannesburg

Route logistics

Project Wollongong

Location insights

Ready to supercharge your app?