What Happened
On the evening of May 17, our production storage cluster experienced a near-simultaneous failure of 14 storage devices distributed across multiple physical hosts. The cluster, which uses Ceph (an industry-standard distributed storage system) to provide redundant storage for customer workloads, was unable to keep certain data ...
Continue reading