Cognitive Services Pricing—Computer Vision API
Use intelligence APIs to enable vision, speech, language, and knowledge capabilities
- No upfront cost
- No termination fees
- Pay only for what you use
US government entities are eligible to purchase Azure Government services from a licensing solution provider with no upfront financial commitment, or directly through a pay-as-you-go online subscription.
Important—The price in R$ is merely a reference; this is an international transaction and the final price is subject to exchange rates and the inclusion of IOF taxes. An eNF will not be issued.
Azure Germany is available to customers and partners who have already purchased this, doing business in the European Union (EU), the European Free Trade Association (EFTA), and in the United Kingdom (UK). It provides data residency in Germany with additional levels of control and data protection. You can also sign up for a free Azure trial.
Computer Vision is not available in the Australia Central region. Please select another region.
Computer Vision is not available in the Australia Central 2 region. Please select another region.
Computer Vision is not available in the Australia Southeast region. Please select another region.
Computer Vision is not available in the Canada East region. Please select another region.
Computer Vision is not available in the France South region. Please select another region.
Computer Vision is not available in the Germany Central region. Please select another region.
Computer Vision is not available in the Germany Northeast region. Please select another region.
Computer Vision is not available in the Korea South region. Please select another region.
Computer Vision is not available in the South India region. Please select another region.
Computer Vision is not available in the UK West region. Please select another region.
Computer Vision is not available in the West India region. Please select another region.
This state-of-the-art, cloud-based API provides developers with access to advanced algorithms that allow you to extract rich information from images in order to categorize and process visual data. Capabilities include image analytics, tagging, recognition celebrities, text extraction, and smart thumbnail generation.
Pricing details
The below pricing for Brand goes into effect on January 29, 2019.
Instance | Transactions Per Second (TPS)** | Features | Price |
---|---|---|---|
Free - Web/Container | 20 per minute | 5,000 transactions free per month N/A in selected region | |
S1 - Web/Container | 10 TPS |
Tag Face GetThumbnail Color Image Type GetAreaOfInterest |
0-1M transactions —
$- per 1,000 transactions 1M-5M transactions — $- per 1,000 transactions 5M-10M transactions — $- per 1,000 transactions 10M-100M transactions — $- per 1,000 transactions 100M+ transactions — $- per 1,000 transactions |
OCR Adult Celebrity Landmark Detect, Objects Brand |
0-1M transactions —
$- per 1,000 transactions 1M-5M transactions — $- per 1,000 transactions 5M-10M transactions — $- per 1,000 transactions 10M-100M transactions — $- per 1,000 transactions 100M+ transactions — $- per 1,000 transactions |
||
Describe+ Recognize Text * |
$- per 1,000 transactions |
Support & SLA
- Free billing and subscription management support are included.
- We guarantee that Cognitive Services running in the standard tier will be available at least 99.9 percent of the time. No SLA is provided for the free trial. Read the SLA.
FAQ
-
Please refer to the documentation for more detailed descriptions of these operations.
- Tag—Computer Vision API returns tags based on more than 2,000 recognizable objects, living beings, scenery, and actions. In cases where tags may be ambiguous or not common knowledge, the API response provides “hints” to clarify the meaning of the tag.
- Face—Detects human faces within a picture
- GetThumbnail—After an image is uploaded, GetThumbnail generates a high-quality thumbnail. The Computer Vision API algorithm analyzes the objects within the image, then crops the image to fit the requirements of the region of interest (ROI).
- Color—The Computer Vision algorithm extracts colors from an image. The colors are analyzed in three different contexts: foreground, background, and whole. The colors are grouped into 12 dominant accent colors.
- Image Type—Computer Vision API can set a Boolean flag to indicate whether an image is black and white or color, as well as use the same method to indicate whether an image is a line drawing or not. Image Type also indicates whether an image is clipart or not, and the quality.
- OCR—Optical Character Recognition (OCR) technology detects text content in an image. The identified text is extracted into a machine-readable character stream for search and numerous other purposes, ranging from medical records to security and banking. It automatically detects the language. OCR saves time and provides convenience for users by allowing them to simply take photos of text instead of transcribing it. Please refer to Documentation for supported languages.
- Adult—Apply the adult/racy settings to enable automated restriction of adult content in images.
- Celebrity—Azure’s celebrity recognition model recognizes 200,000 celebrities from business, politics, sports, and entertainment around the world.
- Analyze—Call multiple operations at once. Specify which functions you want to run and the API will run all of these together. Each operation included in “Analyze” will be counted as a separate transaction.
-
For Recognize Text each POST call counts as a transaction. All GET calls to see the results of the async service are counted as transactions but are free of charge. For all other operations, each feature call counts as a transaction, whether called independently or grouped through the Analyze call. Analyze calls are used to make calling the API easier, but each feature used counts as a transaction. For instance an Analyze call containing Tag, Face, and Adult would count as three transactions.
Please refer to the documentation for the complete list and detailed descriptions of operations.
-
Each operation that you call (either individually or through “Analyze”) will be counted as a transaction. The total bill will be based on the number of transactions for each type of operation within a monthly billing period.
As a specific example, let’s say you make the following calls in a certain monthly billing period:
- 1,500,000 Analyze operations, each calling both Tag and Describe operations
- 500,000 OCR operations
- 4,000,000 Recognize Text operations
Your total bill will be constructed as follows:
Operations Calculations 1,500,000 Tag and 1,500,000 Face operations: First 1,000,000 transactions: $-/1000 * 1,000,000 = $-
Remaining 2,000,000 transactions: $-/1000 * 2,000,000 = $-500,000 OCR operations: $-/1000 * 500,000 = $- 1,500,000 Describe and 4,000,000 Recognize Text operations: $-/1000 * 5,500,000 = $- Total $-