Cosmos DB Mongo API unreachable in North Europe & UK South – repeated outage, ECONNREFUSED

Suneth Thotagamuwa 40 Reputation points
2026-06-19T15:15:18.5266667+00:00

Problem Description: We are experiencing a critical outage affecting multiple Azure Cosmos DB Mongo API accounts across North Europe and UK South. All affected accounts suddenly became unreachable at the same time.

This is the second time we have experienced a major Cosmos DB Mongo API outage.

Current Symptoms:

Clients receive ECONNREFUSED when connecting to the MongoDB endpoint.

DNS resolution succeeds (nslookup resolves the hostname).

  • TCP connectivity tests from PowerShell fail with:
    • WARNING: Name resolution failed
    • TcpTestSucceeded : False
  • Issue is reproducible from multiple networks and machines.
  • Backend metrics (CPU, RU/s, storage) are normal for all accounts.
  • No firewall, VNet, or IP allowlist changes were made.
  • Multiple independent Cosmos DB Mongo accounts are impacted simultaneously across two regions.

Impact: This is a production‑impacting outage. Applications cannot connect to Cosmos DB at all. No read or write operations are possible. Given the previous incident resulted in significant data loss, this recurrence is extremely concerning.

Accounts/Regions Affected:

  • North Europe
  • UK South

Assessment: Based on the symptoms, this appears to be a Cosmos DB Mongo API gateway or regional routing outage affecting multiple regions simultaneously. The recurrence of this issue raises concerns about the reliability of the Mongo API gateway layer.

Azure Cosmos DB
Azure Cosmos DB

An Azure NoSQL database service for app development.


Answer accepted by question author

Manoj Kumar Boyini 17,950 Reputation points Microsoft External Staff Moderator
2026-06-22T09:13:47.0933333+00:00

Hi @Suneth Thotagamuwa

The Azure Cosmos DB for MongoDB vCore cluster was reported as unreachable from both a newly created Azure Function App and local client tools such as MongoDB Compass and Mongo Shell.

Findings

  • The Cosmos DB cluster was healthy and no server-side issues were identified.

Connectivity issues were traced to client-side configuration and network access settings.

Root Cause

The Azure Function App was configured with an incorrect connection string that pointed to a different endpoint. Updating the application to use the connection string provided in the Azure portal restored connectivity. Local client connectivity was affected by IP-based firewall rules and changes in the client public IP address due to VPN usage.

Observations

Existing Function Apps continued to work because established connections were already in use. New connection attempts failed due to endpoint configuration issues and firewall allowlist mismatches.

Recommendations

  • Always use the connection string generated in the Azure portal without modification.
  • Ensure the correct client public IP address is included in the firewall allowlist.
  • Verify outbound connectivity requirements, including access to port 10260.
  • Consider using a Private Endpoint for more secure and predictable connectivity.

The Function App connectivity issue has been resolved. Investigation of local client connectivity behavior is continuing based on telemetry and network validation.

Was this answer helpful?

1 person found this answer helpful.
0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.