Skip to main content

Azure AI Vision pricing

Distill actionable information from images

This state-of-the-art, cloud-based API provides developers with access to advanced algorithms that allow you to extract rich information from images and video in order to categorise and process visual data. Capabilities include image analytics, tagging, recognition celebrities, text extraction and spatial analysis.

Explore pricing options

Apply filters to customise pricing options to your needs.

Prices are estimates only and are not intended as actual price quotes. Actual pricing may vary depending on the type of agreement entered with Microsoft, date of purchase, and the currency exchange rate. Prices are calculated based on US dollars and converted using London closing spot rates that are captured in the two business days prior to the last business day of the previous month end. If the two business days prior to the end of the month autumn on a bank holiday in major markets, the rate setting day is generally the day immediately preceding the two business days. This rate applies to all transactions during the forthcoming month. Sign in to the Azure pricing calculator to see pricing based on your current programme/offer with Microsoft. Contact an Azure sales specialist for more information on pricing or to request a price quote. See frequently asked questions about Azure pricing.

Instance Features Price
Free - Web/Container
20 transactions per minute
S1 - Web/Container
30 transactions per second for Read operations
15 transactions per second for non-Read operations
Image Type
People Detection*
Smart Crops
S2 - Web/Container OCR
Detect, objects
S3 - Web/Container Describe
Dense Captions*
Standard Spatial Analysis on Edge $- per hour
Customised Object Detection (preview) Training: $- per hour
Inferencing: $- per 1,000 transactions
Customised Image Classification (preview) Training: $- per hour
Inferencing: $- per 1,000 transactions
Shelf Image Composition (preview) $- per 1,000 transactions
Shelf Planogram Compliance (preview) $- per 1,000 transactions
Shelf Product Recognition (preview) $- per 1,000 transactions
Shelf Product Recognition - Customised (preview) $- per 1,000 products (bounding boxes)
Image Retrieval*
Background Removal*
Free during preview.

Customers are charged per transaction not per API call. Learn more about what transactions are below.

* Products in Preview

Commitment tiers

Instance Features Price per month Overage
Azure – S1 Read $- per 500,000 transactions $- per 1,000 transactions
$- per 2,000,000 transactions $- per 1,000 transactions
$- per 8,000,000 transactions $- per 1,000 transactions
Connected container – S1 Read $- per 500,000 transactions $- per 1,000 transactions
$- per 2,000,000 transactions $- per 1,000 transactions
$- per 8,000,000 transactions $- per 1,000 transactions

Disconnected container

Instance Category Features Price per year Max usage per year Project usage per month
Disconnected container Computer Vision Read $- 24M transactions 2M transactions
$- 96M transactions 8M transactions

Azure pricing and purchasing options

Connect with us directly

Get a walkthrough of Azure pricing. Understand pricing for your cloud solution, learn about cost optimisation and request a customised proposal.

Talk to a sales specialist

See ways to purchase

Purchase Azure services through the Azure website, a Microsoft representative or an Azure partner.

Explore your options

Additional resources

Azure AI Vision

Learn more about Azure AI Vision features and capabilities.

Pricing calculator

Estimate your expected monthly costs for using any combination of Azure products.


Review technical tutorials, videos, and more Azure AI Vision resources.

  • Please refer to the documentation for more detailed descriptions of these Computer Vision features.

    • Adult – Detects adult/racy content to enable automated restriction in images.
    • Analyse – Calls multiple features at once. Specify which features you want to run and the API will run all of these together. Each feature included in “Analyse” will be counted as a separate transaction.
    • Celebrity – Recognises 200,000 celebrities from business, politics, sports and entertainment around the world.
    • Colour – Extracts colours from an image. The colours are analysed in three different contexts: foreground, background and whole. The colours are grouped into 12 dominant accent colours.
    • Face – Analyses human faces within an image.
    • GetThumbnail – Generates a high-quality thumbnail after an image is uploaded. It analyses the objects within the image, then crops the image to fit the requirements of the region of interest (ROI).
    • Image type – Indicates whether an image is black and white or colour, as well as use the same method to indicate whether an image is a line drawing or not. Indicates whether an image is clipart or not, and the quality.
    • OCR & Read – Both features apply optical character recognition (OCR) technology for detecting text in an image, which can be extracted for multiple purposes. The Read feature delivers highest accuracy for printed and handwritten text extraction. Please refer to Documentation for supported languages for OCR and Read.
    • Spatial analysis – Understand how people move through a physical space in near-real time.
    • Tag – Returns tags based on more than 2,000 recognisable objects, living beings, scenery and actions. In cases where tags may be ambiguous or not commonly known, the API response provides “hints” to clarify the meaning of the tag.
  • Each feature you select is counted as a transaction. There are a few special cases to note:

    1. Analyse allows you to select multiple features at once. For instance, an Analyse call specifying the Tag, Face and Adult features would count as three transactions.
    2. Read allows you to upload multipage PDF documents. Each page is counted as a feature. For instance, a 200-page document would count as 200 transactions.
    3. All GET calls to see the results of the async Read and Recognise text features are counted as transactions but are free of charge.
  • Each operation that you call (either individually or through “Analyse”) will be counted as a transaction. The total bill will be based on the number of transactions for each type of operation within a monthly billing period.

    As a specific example, let’s say you make the following calls in a certain monthly billing period:

    • 1,500,000 Analyse operations, each calling both Tag and Describe operations
    • 500,000 OCR operations
    • 4,000,000 Recognise text operations

    Your total bill will be constructed as follows:

    Operations Resource Calculations Subtotal
    1,500,000 Tag and 1,500,000 Face operations: S1 transactions First 1,000,000 transactions: $-/1,000 * 1,000,000 = $-
    Remaining 2,000,000 transactions: $-/1,000 * 2,000,000 = $-
    500,000 OCR operations: S2 transactions $-/1,000 * 500,000 = $- $-
    1,500,000 Describe and 4,000,000 Recognise Text operations: S3 transactions $-/1,000 * 5,500,000 = $- $-
    Total $- $-

Talk to a sales specialist for a walk-through of Azure pricing. Understand pricing for your cloud solution.

Get free cloud services and a $200 credit to explore Azure for 30 days.

Added to estimate. Press 'v' to view on calculator
Can we help you?