What is a “unit” and how can I scale my service?
Customers can scale API Management by adding and removing units. Each unit has capacity that depends on its tier. For example, the standard tier provides an estimated maximum throughput of approximately 2,500 requests per second. As you add additional units, capacity scales proportionally. For example, two units of standard provides approximately 5,000 requests per second.
Related questions and answers
The developer tier is for API Management trial, development and functional testing. Customers should not use this tier for production.
No. There is no on-premises deployment option available at this time, but you can vote on uservoice if you’d like this capability. However, you can certainly use Azure-based API management with on-premises systems and data.