Servers & CPU
4 Nodes / Xeon D-1528
An analysis of the K8s-Ceph-Production-Sim Cluster as of June 26, 2025.
This report details the performance benchmarks conducted on a newly provisioned 4-node Kubernetes cluster utilizing a Rook Ceph storage backend. The results indicate a healthy, stable, and resilient cluster, with performance consistent with expectations for its configuration (3x data replication over a 10GbE network). The primary performance bottleneck was identified as the 10GbE network, not the underlying NVMe storage. The cluster is performing well and is ready for production workloads that require high availability and data durability.
4 Nodes / Xeon D-1528
128 GB per node
8x Samsung 980 1TB NVMe
10GbE / 3x Replication
The benchmark results confirm that the cluster is healthy, resilient, and performing well for its configuration. The high-speed NVMe drives are significantly underutilized, with performance being limited by the 10GbE network fabric.
This is most evident in the write performance, which is impacted by Ceph's 3x replication factor. As illustrated, a single write request from an application results in three separate write operations across the cluster network, multiplying traffic and latency.
Recommendation
Future performance enhancements should focus on upgrading the network infrastructure (e.g., to 25GbE or higher) before considering faster storage.
(Over 10GbE Network)