Incident Report

Incident: gitlab.git.nrw outage

Christian Schild
Status: Resolved critical

Incident Details

gitlab.git.nrw pilot

Started

May 7, 2025 14:30 +0200

Resolved

May 7, 2025 18:00 +0200

Duration

3h 30m

Affected Components

database gitlab
incident outage pilot

Impact

Service unavailable in the given window.

Root Cause

VSAN disk failures; VMs switched to read‑only; two of three DB nodes impacted.

Resolution

VSAN stabilized and writable filesystems restored; service back online since 18:00.

Additional Details

On May 7, 2025, gitlab.git.nrw was unavailable from approximately 14:30 to 18:00 CEST.

Multiple disk failures in the VSAN cluster in Münster caused several VMs to switch to read‑only filesystems. Two of the three database nodes were affected at the same time, which stopped GitLab.

After stabilizing the VSAN cluster and restoring writable filesystems, the service has been fully available again since 18:00. As we are still in pilot, we continue to harden the platform and closely monitor the systems.