AdvancedScenario
15 min
etcd Disk Latency Causing Cluster Issues
KubernetesetcdStorage
Advertisement
Interview Question
Your Kubernetes etcd cluster shows high fsync latency, causing API server slowness. How do you troubleshoot and resolve?
Key Points to Cover
- Check etcd metrics for fsync/disk latency
- Correlate with node disk performance and saturation
- Migrate to SSD/provisioned IOPS or isolate disks
- Tune etcd compaction and defragmentation
- Monitor disk latency continuously with alerts
Evaluation Rubric
Collects etcd disk latency metrics30% weight
Links disk performance to cluster slowness30% weight
Suggests SSD/tuning solutions20% weight
Mentions monitoring/alerting20% weight
Hints
- 💡etcd heavily depends on low-latency disk writes.
Potential Follow-up Questions
- ❓How to run etcd benchmarks?
- ❓What about etcd compaction tuning?
Advertisement