Computer Vision API

Distil actionable information from images

Extract rich information from images to categorise and process visual data – and protect your users from unwanted content.

Analyse an image

Get information about visual content found in an image. Use tagging, descriptions and domain-specific models to identify content and label it with confidence. Apply the adult/racy settings to enable automated restriction of adult content. Identify image types and colour schemes in pictures.

Generate a thumbnail

Generate a high-quality storage-efficient thumbnail based on any input image. Use thumbnail generation to modify images to best suit your needs for size, shape and style. Apply smart cropping to generate thumbnails that differ from the aspect ratio of your original image, yet preserve the region of interest.

Read text in images

With optical character recognition (OCR) detect text in an image and extract the recognised words into a machine-readable character stream. Analyse images to detect embedded text, generate character streams and enable searching. Take photos of text instead of copying to save time and effort.

Recognise celebrities

The celebrity model is an example of domain-specific models. Our new celebrity model recognition feature recognises 200,000 celebrities from business, politics, sport and entertainment around the world. Domain-specific models are a continuously evolving feature within Computer Vision API.

Have a look at the other Cognitive Services APIs


Allow your apps to process natural language, evaluate sentiment and topics, and learn how to recognise what users want.

Language Understanding Intelligent Service PREVIEW

Teach your apps to understand commands from your users

Text Analytics API PREVIEW

Easily evaluate sentiment and topics to understand what users want

Web Language Model API PREVIEW

Use the power of predictive language models trained on web-scale data

Bing Spell Check API

Detecting and correcting spelling mistakes in your app

Translator Text API

Easily conduct automatic text translation with a simple REST API call


State-of-the-art image processing algorithms help you to moderate content automatically and build more personalised apps by returning smart insights about faces, images and emotions.


Detect, analyse, organise and tag faces in photos


Personalise user experiences with emotion recognition

Computer Vision API PREVIEW

Distil actionable information from images

Content Moderator PREVIEW

Automated image, text and video moderation


Processing spoken language in your applications

Bing Speech API

Convert speech to text and back again to understand user intent

Speaker Recognition API PREVIEW

Use speech to identify and authenticate individual speakers

Translator Speech API

Easily conduct real-time speech translation with a simple REST API call

Custom Speech Service PREVIEW

Overcome speech recognition barriers like speaking style, background noise and vocabulary


Make your apps, web pages and other experiences smarter and more engaging with the Bing Search APIs.

Bing Search APIs

Web, image, video and news search APIs for your app

Bing Autosuggest API

Give your app intelligent autosuggest options for searches


Map complex information and data in order to solve tasks such as intelligent recommendations and semantic search.

Recommendations API PREVIEW

Predict and recommend items that your customers want

Academic Knowledge API PREVIEW

Tap into the wealth of academic content in the Microsoft Academic Graph

Bing Speech API is licensed separately and is governed by the following Terms of Use.

Try Cognitive Services with a free Azure account