Home
Azure pricing
Azure AI Vision pricing

Azure AI Vision pricing

Distill actionable information from images and videos

This state-of-the-art, cloud-based API provides developers with access to advanced algorithms that allow you to extract rich information from images and videos in order to categorize and process visual data. Capabilities include image tagging, people detection, text extraction (OCR), and spatial analysis.

Explore pricing options

Apply filters to customize pricing options to your needs.

Prices are estimates only and are not intended as actual price quotes. Actual pricing may vary depending on the type of agreement entered with Microsoft, date of purchase, and the currency exchange rate. Prices are calculated based on US dollars and converted using London closing spot rates that are captured in the two business days prior to the last business day of the previous month end. If the two business days prior to the end of the month fall on a bank holiday in major markets, the rate setting day is generally the day immediately preceding the two business days. This rate applies to all transactions during the upcoming month. Sign in to the Azure pricing calculator to see pricing based on your current program/offer with Microsoft. Contact an Azure sales specialist for more information on pricing or to request a price quote. See frequently asked questions about Azure pricing.

Region:

Currency:

US government entities are eligible to purchase Azure Government services from a licensing solution provider with no upfront financial commitment, or directly through a pay-as-you-go online subscription.

Learn more

Important—The price in R$ is merely a reference; this is an international transaction and the final price is subject to exchange rates and the inclusion of IOF taxes. An eNF will not be issued.

Learn more

Important—The price in R$ is merely a reference; this is an international transaction and the final price is subject to exchange rates and the inclusion of IOF taxes. An eNF will not be issued.

Image Analysis

Instance	Features		Price
Free (F0) - Web/Container	All		5,000 free transactions per month N/A in selected region 20 transactions per minute
Standard (S1) - Web/Container	Background Removal (preview)		Free
	Group 1	Tag Face GetThumbnail Color Image Type GetAreaOfInterest People Detection (preview) Smart Crops OCR Adult Celebrity Landmark Object Detection Brand	0-1M transactions - $- per 1,000 transactions 1-10M transactions - $- per 1,000 transactions 10-100M transactions - $- per 1,000 transactions 100M+ transactions - $- per 1,000 transactions
	Group 2	Describe Read Caption Dense Captions	0-1M transactions - $- per 1,000 transactions 1M+ transactions - $- per 1,000 transactions

Image Analysis Model Customization

Instance	Features	Price
Free (F0) - Web/Container	All	5,000 free inferencing transactions per month N/A in selected region 20 transactions per minute
Standard (S1) - Web/Container	Custom Object Detection (preview) Custom Image Classification (preview)	Training: $-/Hour Inferencing: $-/1K Transactions

Image Analysis Product Recognition

Instance	Features	Price
Free (F0) - Web/Container	All	5,000 free transactions per month N/A in selected region 20 transactions per minute
Standard (S1) - Web/Container	Shelf Image Composition (preview)	$-/1K transactions
	Shelf Planogram Compliance (preview) Shelf Product Recognition (preview)	$-/1K transactions
	Shelf Product Recognition - Custom (preview)	$-/1K objects

Image Analysis Multimodal Embeddings

Instance	Features	Price
Free (F0) - Web/Container	All	5,000 free transactions per month N/A in selected region
Standard (S1) - Web/Container	Text Embeddings	$0.014 per 1,000 transactions
Standard (S1) - Web/Container	Image Embeddings	$0.1 per 1,000 transactions

Spatial Analysis

Instance	Features	Price
Free (F0) - Web/Container	Spatial Analysis on Edge	1 free camera/month
Standard (S1) - Web/Container	Spatial Analysis on Edge	$- per hour

Video Retrieval

Instance	Features	Price
Standard (S1) - Web/Container	Ingestion	$0.05 per minute of Video
Standard (S1) - Web/Container	Query	$0.25 per 1,000 queries

Commitment Tiers

Instance	Features	Price per month	Overage
Azure - S1	Read	$- per 500,000 transactions	$- per 1,000 transactions
		$- per 2,000,000 transactions	$- per 1,000 transactions
		$- per 8,000,000 transactions	$- per 1,000 transactions
Connected container - S1	Read	$- per 500,000 transactions	$- per 1,000 transactions
		$- per 2,000,000 transactions	$- per 1,000 transactions
		$- per 8,000,000 transactions	$- per 1,000 transactions

Disconnected container

Instance	Category	Features	Price per year	Max usage per year	Project usage per month
Disconnected container	Computer Vision	Read	$-	24M transactions	2M transactions
Disconnected container	Computer Vision	Read	$-	96M transactions	8M transactions

Azure pricing and purchasing options

Connect with us directly

Get a walkthrough of Azure pricing. Understand pricing for your cloud solution, learn about cost optimization and request a custom proposal.

Talk to a sales specialist

See ways to purchase

Purchase Azure services through the Azure website, a Microsoft representative, or an Azure partner.

Explore your options

Additional resources

Azure AI Vision

Learn more about Azure AI Vision features and capabilities.

Pricing calculator

Estimate your expected monthly costs for using any combination of Azure products.

Documentation

Review technical tutorials, videos, and more Azure AI Vision resources.

Frequently asked questions

Frequently asked questions about Azure pricing

Please refer to the documentation for more detailed descriptions of these Computer Vision features.
- Adult—Detects adult/racy content to enable automated restriction in images.
- Analyze—Calls multiple features at once. Specify which features you want to run and the API will run all of these together. Each feature included in “Analyze” will be counted as a separate transaction.
- Celebrity—Recognizes 200,000 celebrities from business, politics, sports, and entertainment around the world.
- Color—Extracts colors from an image. The colors are analyzed in three different contexts: foreground, background, and whole. The colors are grouped into 12 dominant accent colors.
- Face—Analyzes human faces within an image.
- GetThumbnail—Generates a high-quality thumbnail after an image is uploaded. It analyzes the objects within the image, then crops the image to fit the requirements of the region of interest (ROI).
- Image Type—Indicates whether an image is black and white or color, as well as use the same method to indicate whether an image is a line drawing or not. Indicates whether an image is clipart or not, and the quality.
- OCR & Read—Both features apply optical character recognition (OCR) technology for detecting text in an image, which can be extracted for multiple purposes. The Read feature delivers highest accuracy for printed and handwritten text extraction. Please refer to Documentation for supported languages for OCR and Read.
- Spatial analysis—Understand how people move through a physical space in near-real time.
- Tag—Returns tags based on more than 2,000 recognizable objects, living beings, scenery, and actions. In cases where tags may be ambiguous or not common knowledge, the API response provides “hints” to clarify the meaning of the tag.
Each feature you select is counted as a transaction. There are a few special cases to note:
1. Analyze allows you to select multiple features at once. For instance, an Analyze call specifying the Tag, Face, and Adult features would count as three transactions.
2. Read allows you to upload multipage PDF documents. Each page is counted as a feature. For instance, a 200 page document would count as 200 transactions.
3. All GET calls to see the results of the async Read and Recognize Text features are counted as transactions but are free of charge.

What will my bill look like?

Each operation that you call (either individually or through “Analyze”) will be counted as a transaction. The total bill will be based on the number of transactions for each type of operation within a monthly billing period.

As a specific example, let’s say you make the following calls in a certain monthly billing period:

1,500,000 Analyze operations, each calling both Tag and Describe operations
500,000 OCR operations
4,000,000 Recognize Text operations

Your total bill will be constructed as follows:

Operations	Resource	Calculations	Subtotal
1,500,000 Tag and 1,500,000 Face operations:	S1 transactions	First 1,000,000 transactions: $-/1,000 * 1,000,000 = $- Remaining 2,000,000 transactions: $-/1,000 * 2,000,000 = $-	$-
500,000 OCR operations:	S2 transactions	$-/1,000 * 500,000 = $-	$-
1,500,000 Describe and 4,000,000 Recognize Text operations:	S3 transactions	$-/1,000 * 5,500,000 = $-	$-
Total		$-	$-

Talk to a sales specialist for a walk-through of Azure pricing. Understand pricing for your cloud solution.

Request a pricing quote

Get free cloud services and a $200 credit to explore Azure for 30 days.

Try Azure for free

Added to estimate. Press 'v' to view on calculator

Popular

AI + machine learning

Analytics

Compute

Containers

Databases

DevOps

Developer tools

Hybrid + multicloud

Identity

Integration

Internet of Things

Management and governance

Media

Migration

Mixed reality

Mobile

Networking

Security

Storage

Web

Virtual desktop infrastructure

Use cases

Application development

AI

Cloud migration and modernization

Data and analytics

Hybrid cloud and infrastructure