Skip navigation

Media Services pricing

Encode, store, and stream video and audio at scale

Azure Media Services lets you deliver any media, on virtually any device, to anywhere in the world using the cloud. The collection of services providing encoding, live or on-demand streaming, content protection and indexing for video and audio content.

Explore pricing options

Apply filters to customise pricing options to your needs.

Prices are estimates only and are not intended as actual price quotes. Actual pricing may vary depending on the type of agreement entered with Microsoft, date of purchase, and the currency exchange rate. Prices are calculated based on US dollars and converted using Thomson Reuters benchmark rates refreshed on the first day of each calendar month. Sign in to the Azure pricing calculator to see pricing based on your current program/offer with Microsoft. Contact an Azure sales specialist for more information on pricing or to request a price quote. See frequently asked questions about Azure pricing.

Note: Live Video Analytics and Video Indexer have moved out of Azure Media Services to Azure Applied AI Services and been renamed Azure Video Analyzer and Azure Video Analyzer for Media, respectively. To learn more, visit the Azure Video Analyzer product page.

Video & Audio Analytics

Extract in a single job from your audio and video files using AI for video technologies.

Audio analysis is available in two bundles of features at different price points. The standard Audio Analyzer preset provides a richer set of metadata using a predefined set of AI-based audio analysis models, including speech transcription, speaker indexing, speech sentiment analysis, and keywords and formatted output captions or subtitles. The Basic Audio Analyzer preset provides a low-cost option to only extract speech transcription, and format output captions and subtitles. The Basic Audio Analyzer will produce two separate meters on your bill, including a line for transcription and a separate line for caption and subtitle formatting.

Not available Video analysis2 Audio Analysis2, 3 Basic Audio Analysis1, 2, 3
Price per input minute $- $- $-

1Rounded by second with a minimum of 15 seconds.

2Pricing for all analysis presets when used directly in the Azure Media Services v3 API is the same.

3When using the standard and basic audio analysis modes in data centres that do not have a local speech-to-text endpoint, additional in-region networking data transfer rates apply. The following regions currently do not have a local speech endpoint and require in-region data transfer to the nearest endpoint: Australia Southeast, Canada East, France South, Germany West Central, Germany North, India South, India West, Korea South, South Africa North, South Africa West, UAE Central, UAE North, UK West, China East, China North, China North 2, US Gov Texas and US Gov Arizona. The list of regions is subject to change as new speech endpoints are enabled.

Redactor

Anonymise videos by detecting and blurring the faces of selected individuals. Azure Media Redactor is ideal for use in public safety and news media scenarios. Pricing is based on the duration and resolution of the input file. Redactor takes advantage of your media reserved units to run video processing tasks in parallel when possible (Note: SLA for Redactor only applies on S3 media reserved units). See our documentation for more information.

Not available S1 S2 S3
Supported input resolutions1 640 x 480 and below 641 x 481 to 1280 x 720 1281 x 721 to 1920 x 1200
First 50,000 minutes/month2 $-/content minute ($-/hour) $-/content minute ($-/hour) $-/content minute ($-/hour)
Next 950,000 minutes (50K-1M minutes)/month $-/content minute ($-/hour) $-/content minute ($-/hour) $-/content minute ($-/hour)
Over 1,000,000 minutes/month $-/content minute ($-/hour) $-/content minute ($-/hour) $-/content minute ($-/hour)

1Both dimensions of the input resolution must fit within the meter’s supported input resolution, or a higher resolution will be metered (see FAQs for an example).

2One minute minimum usage. All usage is rounded up to the next minute.

Azure Media Indexer

Azure Media Indexer is being retired on 1 March 2023 and is being replaced by Azure Media Services video and audio analytics (see above in this page).

For migration details, please see documentation.

Video on Demand (VoD) encoding

Standard Encoder transcodes video and audio input files into output formats suitable for playback on a variety of devices, including smartphones, tablets, PCs, gaming consoles and televisions. See the standard encoder documentation for details on supported input and output formats, codecs and settings.

Standard Encoder1 Premium Encoder2
$- per output minute $- per output minute

1Media Services does not charge for cancelled or errored jobs. For example, a job that has reached 50% progress and is cancelled is not charged at 50% of the job minutes. You are only charged for finished jobs.

2The Premium Encoder is only available in the v2 API. Version 2 of the Azure Media Services API and the Premium Encoder feature will be retired on 29 February 2024. Please refer to the migration guide for recommended solutions.

Note: Enterprise Agreement customers should contact their Microsoft account manager for pricing.

H.264

H264 pricing

Resolution Price
SD $-
HD $-
4K $-

Audio add-on

Audio AAC $-

H.264 multipliers

SD 1 10 minutes of SD output counts as 10 minutes
HD 2 10 minutes of HD output counts as 20 minutes
4K 4 10 minutes of 4K output counts as 40 minutes
AAC 0.25 4 minutes of audio output counts as 1 minute

Pricing example: You have a QuickTime video at 1920 x 1080p resolution, 20 minutes in duration, that you encode to an MP4 output of the same length, with the video at 1920 x 1080p resolution and one audio track. The effective multiplier would be 2 (for HD) plus 0.25 (for audio), adding up to a multiplier of 2.25. You would be billed for a total of (20 minutes x 2.25) = 45.00 output minutes. If you used the Media Encoder Standard for this encoding, then your cost would be (45.00 output minutes x $- / output minute) = $-.

See FAQ for more examples.

HEVC

HEVC pricing

Speed Balanced Quality
<=30 frames/sec >30 frames/sec and <=60 frames/sec >60 frames/sec and <=120 frames/sec <=30 frames/sec >30 frames/sec and <=60 frames/sec >60 frames/sec and <=120 frames/sec <=30 frames/sec >30 frames/sec and <=60 frames/sec >60 frames/sec and <=120 frames/sec
SD $- $- $- $- $- $- $- $- $-
HD $- $- $- $- $- $- $- $- $-
4K $- $- $- $- $- $- $- $- $-
8K $- $- $- $- $- $- $- $- $-

Price per output minute

Audio add-on

Audio AAC $-

Base Standard Definition (SD) encoding meter: $-

HEVC multipliers

SD 1 10 minutes of SD output counts as 10 minutes
HD 2 10 minutes of HD output counts as 20 minutes
4K 4 10 minutes of 4K output counts as 40 minutes
8K 8 10 minutes of 8K output counts as 80 minutes
&lt;=30 frames/sec 1 10 minutes of <=30 frames/sec output counts as 10 minutes
>30 frames/sec and &lt;=60 frames/sec 1.25 10 minutes of > 30 and <=60 frames/sec output counts as 12.5 minutes
>60 frames/sec and &lt;=120 frames/sec 1.5 10 minutes of >60 and <=120 frames/sec output counts as 15 minutes
Speed 0.5 10 minutes of speed profile output counts as 5 minutes
Balanced 1 10 minutes balanced profile output counts as 10 minutes
Quality 2 10 minutes of quality profile output counts as 20 minutes
HEVC codec 4/1.5 (2.67) 10 minutes of HEVC codec output counts as 26.7 minutes
Audio AAC 0.25 4 minutes of audio output counts as 1 minute

Pricing examples

Q: How do I calculate the total costs based on the encoder multipliers?
A: To calculate the price per minute, you need to take all the multiplier values of the encoding preset and the regional base price (resolution * frames/sec * preset * HEVC codec * regional base price * minutes). As an example, I want to encode 10 minutes of video using HEVC codec with a resolution of HD at 30 frames per second (frames/sec) using the speed-optimised preset: HD = 2, 30 frames/sec = 1, speed = 0.5, HEVC = 2.67, minutes = 10 and a base price = $-. The formula is: 2 * 1 * 0.5 * 2.67 * 10 * $- = $- for 10 minutes of HEVC encoding.

Q: When I use multiple outputs, how do I calculate all of the applied multipliers?
A: You need to calculate the multiplier value per output based on the example above. Finally, you add all multipliers and multiply by the base price per minute in the region where you plan to run the job.

See FAQ for more examples.

Resolution thresholds3

SD is an output with less than 720 resolution.

HD is an output with greater than or equal to 720 and less than or equal to 1080 resolution.

4K is an output with greater than 1080 and less than or equal to 2304 resolution.

8K is an output with greater than 2304 and up to and including 4608 resolution.

3Resolutions thresholds are based on the product of the width * height. We do not currently distinguish between landscape and portrait orientation.

Additional charges

Storage and data transfer rates may also apply, and will be charged at standard Azure rates.

Live Events

Live events (also known as Channels in legacy APIs) enable you to stream content to your audience in near real time. Live events are used in conjunction with streaming endpoints (origin) to provide live streaming services. For details on streaming pricing, please refer to the streaming section of this page. Your options for live events are the following.

Live encoding type Basic pass-through Pass-through Standard live encoding Premium Live Encoding
Live event input Supports combined input up to 5Mbps, individual bitrates not to exceed 4Mbps. No video frame rate restriction. Supports combined input up to 60Mbps, individual bitrates not to exceed 20Mbps. No video frame rate restriction. Supports single bitrate input. Individual input bandwidth cannot exceed 20Mbps. Video frame rate cannot exceed 60 frames/second. Supports single bitrate input. Individual input bandwidth cannot exceed 20Mbps. Video frame rate cannot exceed 60 frames/second.
Output stream Same as the input stream Same as the input stream Up to 720p at 30 frames/sec Up to 1080p at 30 frames/sec
Multi-bitrate output Only if input is multi-bitrate content Only if input is multi-bitrate content Yes Yes
Running state price $-/minute ($-/hour) $-/minute ($-/hour) $-/minute ($-/hour) $-/minute ($-/hour)
Standby state price $-/minute ($-/hour) $-/minute ($-/hour) $-/minute ($-/hour) $-/minute ($-/hour)

For all live event types, billing is based on the amount of time they are in running or standby state, regardless of whether there is video being streamed through the service. For details on live event states and billing, please refer to the FAQ below.

Storage and data transfer rates may also apply and will be charged at standard Azure rates.

Live TranscriptionPreview

Live Transcription is designed to work in conjunction with live streaming workloads, as an add-on capability when using Pass-through Live Events or Standard/Premium Live Encoder Live Events. Billing is based on the amount of time the Live Event is in running state, whether or not there is video being streamed through the service. For details on live transcription and billing, please refer to the FAQ below. This preview feature is only available in specific regions. See documentation for details.

Instance Price
Live Transcription $-/minute ($-/hour)

Streaming

Deliver on-demand and live video streams to customers in multiple formats and at scale. Standard and Premium streaming services deliver content directly to a media player application or to a Content Delivery Network for further distribution.

Media Services customers choose either a standard endpoint or one or more premium streaming units, according to their needs. Standard streaming endpoint is suitable for most streaming workloads. It includes the same features as premium streaming units and scales outbound bandwidth automatically. Premium streaming units are suitable for advanced workloads, providing dedicated, scalable bandwidth capacity. Premium streaming units “stack”, meaning each unit enabled provides additional bandwidth capacity to the application. Standard streaming endpoint doesn’t stack – instead it scales bandwidth automatically based on bandwidth requirements. See more information.

Streaming is billed as the combination of streaming services and the quantity of data transferred. When Azure Content Delivery Network is enabled via Media Services portal or API for a streaming endpoint, standard Content Delivery Network pricing applies for all data transferred. When Azure Content Delivery Network isn’t enabled for a streaming endpoint, data transfer is charged at data transfer pricing.

Streaming units

Standard Streaming Endpoint Premium Streaming Units
Price (preview) 2 $-/day ($-/month 1) N/A
Price (per unit) N/A $-/day ($-/month 1)
Bandwidth Up to 600 Mbps from streaming endpoint and scales with Content Delivery Network Up to 200 Mbps/unit

1Billing is prorated hourly. 31-day billing month is used for the rate calculation.

2Preview pricing subject to change upon general availability.

Content protection

Secure your assets with PlayReady digital rights management (DRM), Widevine Modular licence delivery, Apple FairPlay Streaming or clear key Advanced Encryption Standard (AES) encryption. Pricing is based on the number of licences or keys issued by the service.

PlayReady

PlayReady
Price $-/100 licences

Widevine

Widevine
Price $-/100 licences

Advanced Encryption Standard (AES) Keys

AES Keys
Price $-/100 keys

Storage and data transfer rates may also apply, and will be charged at standard Azure rates.

FairPlay

FairPlay
Price $-/100 licences

Azure pricing and purchasing options

Connect with us directly

Get a walkthrough of Azure pricing. Understand pricing for your cloud solution, learn about cost optimisation and request a custom proposal.

Talk to a sales specialist

See ways to purchase

Purchase Azure services through the Azure website, a Microsoft representative or an Azure partner.

Explore your options

Additional resources

Media Services details

Learn more about Media Services features and capabilities.

Pricing calculator

Estimate your expected monthly costs for using any combination of Azure products.

SLA

Review the service-level agreement for this product.

Documentation

Review technical tutorials, videos and more Media Services resources.

  • Media reserved units were previously used to control encoding concurrency and performance. You no longer need to set media reserved units as the system will automatically scale up and down based on load for both V3 and V2 accounts. While there were previously charges for media reserved units, as of 17 April 2021, there are no longer any charges for accounts that have configuration for media reserved units.

  • No, you will only get charged based on output minutes. See the example below for more details.

  • No, you will not be charged if an encoding job fails.
  • Example 1—Joe has 30 encoding jobs for which he uses Media Encoder Standard (i.e. the standard encoder rate applies). Each job creates 10 output minutes (totaling 300 output minutes).

    • Output minutes (standard encoder): 300 x $-/minute = $-

    Example 2—Joe has a two-hour movie that is at 1920 x 1080 resolution, which he encodes with Media Encoder Standard using the default “H264 Multiple Bitrate 1080p” preset, so that he can stream the movie to iOS and Android devices. The encoding preset produces 3 HD output videos, 5 SD output videos, and 1 audio. The total multiplier for this encoding task would be (3x2 + 5 + 0.25) = 11.25. Total output minutes would therefore be 11.25 x 2 x 60 = 1,350. The total cost would be 1,350 x $-/minute = $-.

  • If you generate thumbnails as part of a regular encoding job, as in the sample here, then there is no additional charge for generating thumbnail images. If, however, you submit an encoding task that generates only thumbnails (i.e. output has no video or audio), then each image in the output asset is counted as one second (1/60 of a minute).

  • No, the charges for streaming units are prorated on a per-hour basis. If there is any usage within the hour, the streaming unit will be billed for the entire hour. To understand how much a streaming project will cost, please view our online calculator.

  • A streaming unit provides a dedicated set of resources for content streaming, with each unit increasing guaranteed bandwidth throughput by up to 200 Mbps.

  • Yes, you can purchase multiple streaming units for content streaming.

  • You are charged based on the highest number of streaming units that are provisioned each hour. In this example, if you made the change at 3:30 p.m., you will be charged for two units until 3 p.m. on that day and for four units starting at 3 p.m.

  • It depends on the current state of a live event. Possible values include:

    • Stopped. This is the initial state of the live event after its creation. In this state, the live event properties can be updated but streaming is not allowed.
    • Starting. The live event is being started. No updates or streaming are allowed during this state. If an error occurs, the live event will return to the “Stopped” state.
    • Running. The live event is capable of processing live streams.
    • Stopping. The live event is being stopped. No updates or streaming are allowed during this state.
    • Deleting. The live event is being deleted. No updates or streaming are allowed during this state.
    • Allocating. The allocate action was called on the live event and resources are being provisioned for this live event. Once this operation has been done successfully, the live event will transition to Standby state.
    • StandBy. The live event resources have been provisioned and are ready to start. Billing occurs in this state. Most properties can still be updated; however, ingest or streaming is not allowed during this state.

    The following table shows how the states map to the billing mode.

    Live event state Billed?
    Starting No (transient state)
    Running Yes
    Stopping No (transient state)
    Stopped No
    Allocating No (transient state)
    StandBy Yes
  • Scenario: Stella wants to stream a 2-hour sporting event and intends to use a live event with Standard live encoding

    1. Stella sets up the live event and starts it 20 minutes before the event begins. The live event is in the “Starting” state for 5 minutes.
      • Stella is not billed during this time
    2. After starting, the live event moves into the “Running” state ready to receive a stream. The live event remains in the “Running” state for 15 minutes.
      • Stella does get billed for this period
    3. The event begins streaming on time and lasts for 2 hours or 120 minutes.
      • Stella gets billed for this period
    4. Stella stops the video feed but does not stop the live event. Stella leaves the Live Event in this mode for 5 minutes.
      • Stella does get billed for this period
    5. Stella then streams a post-game programme for 30 minutes.
      • Stella gets billed for this period
    6. After the post-game programme, Stella immediately stops the live event and deletes it.
      • Stella is not billed during this time

    In total, the live event was in the “Running” state for 170 minutes (15 minutes with no stream before the event + 120 minute event + 5 minutes with no stream after event + 30 minute post-game show). The charges are:

    170 minutes x ~$- = ~$-

  • Both edges of the input video must fit within the supported input resolution of each meter. For example, a 600 x 481 video exceeds the maximum 640 x 600 resolution of S1 by virtue of both edges being greater than 480, and so would be subject to S2 metering.

  • No, you can use Standard live encoding in this case. It is the product of height and width that needs to be at or below 1280*720. You will only be billed at Standard live encoding rates.
  • You will be charged based on the type of your live event. In this case, you would choose Standard live encoding, and get billed at the Standard live encoding rates, even if the output video is not 1280x720.

    Note that if you choose to configure the live event to use Premium live encoding, you will be billed at this rate.

  • You are billed for Live Transcription when you enable this feature while using a Live Event to stream, for example, a webinar. Your account is billed for the duration that the Live Event is in a Running state. Possible values for the state of a Live Event include:

    • Stopped. This is the initial state of the live event after its creation. In this state, the live event properties can be updated but streaming is not allowed.
    • Starting. The live event is being started. No updates or streaming are allowed during this state. If an error occurs, the live event will return to the “Stopped” state.
    • Running. The live event is capable of processing live streams.
    • Stopping. The live event is being stopped. No updates or streaming are allowed during this state.
    • Deleting. The live event is being deleted. No updates or streaming are allowed during this state.

    The following table shows how the states map to the billing mode.

    Live event state Billed?
    Starting No (transient state)
    Running Yes
    Stopping No (transient state)
    Stopped No
  • Scenario: Stella wants to stream a 2-hour webinar and intends to use a Live Event with Standard Live Encoding, with Live Transcription enabled.

    1. Stella sets up the live event and starts it 20 minutes before the event begins. The live event is in the “Starting” state for 5 minutes.
      • Stella is not billed during this time
    2. After starting, the live event moves into the “Running” state ready to receive a stream. The live event remains in the “Running” state for 15 minutes.
      • Stella does get billed for this period, for both Standard Live Encoding and Live Transcription. She will be billed regardless of whether or not the audio feed has any spoken words in this period.
    3. The event begins streaming on time and lasts for 2 hours or 120 minutes.
      • Stella gets billed for this period, for both Standard Live Encoding and Live Transcription
    4. Stella stops the video feed but does not stop the live event. Stella leaves the Live Event in this mode for 5 minutes.
      • Stella does get billed for this period, for both Standard Live Encoding and Live Transcription
    5. Stella then streams a question-and-answer (Q&A) session for 30 minutes.
      • Stella gets billed for this period, for both Standard Live Encoding and Live Transcription
    6. After the Q&A session, Stella immediately stops the Live Event and deletes it.
      • Stella is not billed during this time

    In total, the Live Event was in the “Running” state for 170 minutes (15 minutes with no stream before the event + 120 minute event + 5 minutes with no stream after event + 30 minute Q&A). The charges are: 170 minutes x ~$- = ~$- (170 minutes x (Standard Live Encoding Rate + Live Transcription Rate)

  • As an example, when you analyse audio using the Basic Audio Analysis and upload one hour of content the bundled price (Media Transcription + Caption & Subtitle) per hour = $-. You will be billed for $- in total, but it will be reported as two different names, $- for Media Transcription + $- for Captions and Subtitles).

Talk to a sales specialist for a walk-through of Azure pricing. Understand pricing for your cloud solution.

Get free cloud services and $200 in credit to explore Azure for 30 days.

Added to estimate. Press 'v' to view on calculator