An Azure relational database service.
Hi Chandan,
Yes, a 5–10 minute delay can be expected in this type of design. Autoscale should not be treated as an instant failover mechanism, especially when the load ramp is aggressive and the backend is Azure SQL Elastic Pool.
Your current flow:
Alert -> Action Group -> Logic App -> Automation Runbook -> Scale
will usually add more delay than native autoscale. For App Service, I would use Azure Monitor autoscale or App Service automatic scaling directly instead of routing the scale action through a Logic App/runbook. For Azure SQL Elastic Pool, scaling vCores is also not something I would rely on during a live traffic spike.
Recommended approach:
Pre-scale before the load test or known traffic window Scale App Service instances and SQL Elastic Pool vCores before the test starts. Reactive scaling is usually too late once CPU/SQL throttling has already started.
Use native autoscale for App Service Configure minimum instance count, maximum instance count, and scale-out rules directly. Use lower thresholds or leading indicators such as requests, CPU, memory, response time, or queue length.
Keep SQL capacity ahead of demand For Azure SQL Elastic Pool, pre-size the pool for expected peak load. Also review query tuning, indexes, connection pooling, retry logic, and max pool settings in the application.
For Blazor Server/SignalR, validate connection scaling Blazor Server keeps active SignalR connections. Consider Azure SignalR Service and make sure ARR affinity/session behavior is understood during scale-out.
Use health checks and warm instances Enable App Service Health Check and Always On. If using App Service automatic scaling, configure always-ready/prewarmed capacity where applicable.
Do not target “zero downtime” only through autoscale Keep headroom, pre-scale for planned load, use retry logic, graceful degradation, caching, and back-pressure/rate limiting where appropriate.
Predictive autoscale in Azure Monitor is mainly for Virtual Machine Scale Sets, not Azure SQL Elastic Pool or App Service in this scenario. For this workload, scheduled/pre-scaling plus native App Service autoscale is the safer production pattern.