Cognitive Services Pricing—Computer Vision API

Use intelligence APIs to enable vision, language, and search capabilities.

This state-of-the-art, cloud-based API provides developers with access to advanced algorithms that allow you to extract rich information from images in order to categorize and process visual data. Capabilities include image analytics, tagging, recognition celebrities, text extraction, and smart thumbnail generation.

Pricing details

The below pricing for Brand goes into effect on January 29, 2019.

Instance Transactions Per Second (TPS)** Features Price
Free - Web/Container 20 per minute
S1 - Web/Container 10 TPS Tag
Face
GetThumbnail
Color
Image Type
GetAreaOfInterest
0-1M transactions — $- per 1,000 transactions
1M-5M transactions — $- per 1,000 transactions
5M-10M transactions — $- per 1,000 transactions
10M-100M transactions — $- per 1,000 transactions
100M+ transactions — $- per 1,000 transactions
OCR
Adult
Celebrity
Landmark
Detect, Objects
Brand
0-1M transactions — $- per 1,000 transactions
1M-5M transactions — $- per 1,000 transactions
5M-10M transactions — $- per 1,000 transactions
10M-100M transactions — $- per 1,000 transactions
100M+ transactions — $- per 1,000 transactions
Describe+
Recognize Text *
Read
$- per 1,000 transactions

Customers are charged per transaction not per API call. Learn more about what transactions are below.

* Products in Preview

+ Non-English languages are in Preview

** TPS only applies to web endpoint

Support & SLA

  • Free billing and subscription management support are included.
  • We guarantee that Cognitive Services running in the standard tier will be available at least 99.9 percent of the time. No SLA is provided for the free trial. Read the SLA.

FAQ

  • Please refer to the documentation for more detailed descriptions of these operations.

    • Adult—Detect adult/racy content to enable automated restriction in images.
    • Analyze—Call multiple features at once. Specify which features you want to run and the API will run all of these together. Each feature included in “Analyze” will be counted as a separate transaction.
    • Celebrity—Recognizes 200,000 celebrities from business, politics, sports, and entertainment around the world.
    • Color—Extracts colors from an image. The colors are analyzed in three different contexts: foreground, background, and whole. The colors are grouped into 12 dominant accent colors.
    • Face—Analyzes human faces within an image.
    • GetThumbnail—Generates a high-quality thumbnail after an image is uploaded. It analyzes the objects within the image, then crops the image to fit the requirements of the region of interest (ROI).
    • Image Type—Indicates whether an image is black and white or color, as well as use the same method to indicate whether an image is a line drawing or not. Indicates whether an image is clipart or not, and the quality.
    • OCR & Read—Both features apply optical character recognition (OCR) technology for detecting text in an image, which can be extracted for multiple purposes. The Read feature delivers highest accuracy for printed and handwritten text extraction. Please refer to Documentation for supported languages for OCR and Read.
    • Tag—Returns tags based on more than 2,000 recognizable objects, living beings, scenery, and actions. In cases where tags may be ambiguous or not common knowledge, the API response provides “hints” to clarify the meaning of the tag.
  • Each feature you select is counted as a transaction. There are a few special cases to note:

    1. Analyze allows you to select multiple features at once. For instance, an Analyze call specifying the Tag, Face, and Adult features would count as three transactions.
    2. Read allows you to upload multipage PDF documents. Each page is counted as a feature. For instance, a 200 page document would count as 200 transactions.
    3. All GET calls to see the results of the async Read and Recognize Text features are counted as transactions but are free of charge.
  • Each operation that you call (either individually or through “Analyze”) will be counted as a transaction. The total bill will be based on the number of transactions for each type of operation within a monthly billing period.

    As a specific example, let’s say you make the following calls in a certain monthly billing period:

    • 1,500,000 Analyze operations, each calling both Tag and Describe operations
    • 500,000 OCR operations
    • 4,000,000 Recognize Text operations

    Your total bill will be constructed as follows:

    Operations Resource Calculations Subtotal
    1,500,000 Tag and 1,500,000 Face operations: S1 transactions First 1,000,000 transactions: $-/1000 * 1,000,000 = $-
    Remaining 2,000,000 transactions: $-/1000 * 2,000,000 = $-
    $-
    500,000 OCR operations: S2 transactions $-/1000 * 500,000 = $- $-
    1,500,000 Describe and 4,000,000 Recognize Text operations: S3 transactions $-/1000 * 5,500,000 = $- $-
    Total $- $-

Resources

Estimate your monthly costs for Azure services

Review Azure pricing frequently asked questions

Learn more about Cognitive Services

Review technical tutorials, videos, and more resources

Added to estimate. Press 'v' to view on calculator

Learn and build with $200 in credit, and keep going for free