Skip to main content

Preview: Automatic Scaling for App Service Web Apps

Published date: May 08, 2023

App Service has an automatic scaling capability that adjusts the number of running instances of your application based on incoming HTTP requests. This ensures that your web applications can handle varying levels of traffic. You have control over scaling settings, such as defining the minimum and maximum number of instances per web app, to optimize performance and avoid bottlenecks. The platform also addresses cold start issues with prewarmed instances that act as a buffer when scaling out, ensuring smooth performance transitions. automatic scaling is available for the Premium Pv2 and Pv3 pricing tiers, and charges are calculated per second using existing billing meters. 

Here are a few scenarios where you should use scale automatically:

  • You don't want to set up autoscale rules based on resource metrics.
  • You want your web apps within the same App Service plan to scale differently of each other.
  • A web app is connected to backend data sources like databases or legacy systems, which may not be able to scale as fast as the web app. Scaling automatically allows you to set the maximum number of instances your App Service plan can scale to. This setting helps where the backend is a bottleneck to scaling and is overwhelmed by the web app.

For details about Automatic Scaling Please refer to

  • App Service
  • Features

Related Products