Error in Virtual Machine Service

thanh 0 Reputation points
2026-06-29T09:14:07.1433333+00:00

Dear Azure Support Team,

I am writing to request an official Root Cause Analysis (RCA) regarding a critical infrastructure incident that occurred on our production Database Virtual Machine.

Incident Timeline & Symptoms:

  1. Initial Symptom: Our production database suddenly went offline. Upon checking the Azure Portal, we noticed that the VM status flagged a warning: “svhc-database-vm-prod virtual machine agent status is not ready” Screenshot 2026-06-28 at 12.41.14.png

Attempted Action: To restore services immediately, our team initiated a manual restart of the VM via the portal.

Infrastructure Failure: Following the restart attempt, the VM became completely unavailable, and the platform threw the following critical error message:

"We're sorry, your virtual machine isn't available and it is being redeployed due to an unexpected failure on the host server. Azure has begun the auto-recovery process and is currently starting the virtual machine on a different host."

Resolution: The automated recovery failed to restore the VM timely, forcing us to execute a manual Redeploy action to successfully migrate the instance to a new healthy hardware node.

Our Request: Our internal management requires a strict and formal Root Cause Analysis (RCA) for this downtime. The generic PlatformInitiated tag in our Activity Logs does not provide enough granular details to justify the hardware crash to our stakeholders.

Could you please investigate your underlying physical host server logs during this timeframe and provide us with:

  1. The exact technical cause of the physical host server failure (e.g., hardware fault, memory error, hypervisor crash).
  2. Why the VM Agent went into a "Not Ready" state prior to the platform-initiated event.
  3. Preventive measures Microsoft Azure is implementing to mitigate such unexpected blade node failures for our VM tier.

Looking forward to your detailed official RCA report.

Best regards,

Azure Virtual Machines
Azure Virtual Machines

An Azure service that is used to provision Windows and Linux virtual machines.


1 answer

Sort by: Most helpful
  1. Alex Burlachenko 23,250 Reputation points MVP Volunteer Moderator
    2026-06-30T11:51:13.69+00:00

    Hi thanh,

    Thx for sharing urs issue here at Q&A portal.

    For an official RCA, Microsoft Q&A won’t be enough. Only Azure Support can review the backend host logs and provide a formal incident explanation. From the message, Azure detected an unexpected physical host failure and tried auto-recovery. The VM agent Not Ready before the restart may have been an early symptom of the same host/guest communication problem, but only MS can confirm from platform logs. Look at Resource Health and Activity Log first, then open an Azure VM support case with VM name, region, UTC incident time, Activity Log event IDs, Resource Health event, and screenshots.

    Ask Support specifically for host-level investigation and RCA for the platform-initiated redeploy & auto-recovery event.

    Rgds,

    Alex

    &

    If my answer was helpful pls mark it and additional thx if u follow me at Q&A portal

    and at my blog https://ctrlaltdel.blog/

    Was this answer helpful?

    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.