IntermediateScenario
10 min
High CPU Steal Time on VMs
CloudLinuxPerformance
Advertisement
Interview Question
Services on certain VMs show latency spikes correlated with CPU steal time. How do you investigate and mitigate?
Key Points to Cover
- Confirm steal time via vmstat, top, or hypervisor metrics
- Correlate with noisy neighbors or host contention
- Migrate workloads/instances or change instance family/placement
- Right-size CPU and enable CPU pinning/affinity if applicable
- Work with provider; add SLO alerts for steal time
Evaluation Rubric
Measures steal time correctly30% weight
Links contention to latency impact30% weight
Applies workload/infra mitigations20% weight
Adds ongoing monitoring/alerts20% weight
Hints
- 💡Dedicated hosts or CPU credits may help.
Potential Follow-up Questions
- ❓When to use dedicated instances?
- ❓How does cgroup CPU quota affect latency?
Advertisement