Troubleshooting
All interview questions related to Troubleshooting
A pod is stuck in Pending state. What are the common causes and troubleshooting steps?
How do you quickly identify which processes are consuming the most CPU on a Linux server?
What is the difference between ping and traceroute, and when would you use each?
A server reports high memory usage. What quick checks do you perform to identify the cause?
Which command would you use to find the largest files/directories consuming disk space in a path?
What is the difference between readiness and liveness probes in Kubernetes?
What do the 1, 5, and 15 minute load averages indicate on Linux, and how do you interpret them relative to CPU cores?
A Kubernetes pod is stuck in a restart loop. Walk me through your systematic approach to diagnose and fix this issue.
Your production Kubernetes cluster shows unusually high CPU usage in multiple pods. Walk me through your investigation and mitigation steps.
Your workloads face intermittent connectivity failures across regions. Walk through your diagnostic and remediation approach.
How would you detect, troubleshoot, and mitigate deadlocks in a relational database system?
Your application's database response times have increased by 300% over the last hour. Users are complaining about slow page loads. How do you investigate and resolve this?
A pod has been stuck in Pending state for over 15 minutes. Walk me through how you would troubleshoot and resolve this issue.
One of your production nodes is reporting 100% disk usage and workloads are failing. How do you investigate and resolve this?
Your application logs show frequent database connection pool exhaustion errors. How do you investigate and fix this?
Your API’s average latency jumped from 100ms to 2s without an increase in traffic. How would you investigate?
One node in your cluster shows 100% CPU usage with context switching spikes. How do you troubleshoot?
Your centralized logging pipeline stops ingesting logs from multiple services. How do you debug?
On a Linux production node, `df -h` reports the filesystem nearly full, but `du -sh /` shows far less used space. How do you reconcile the discrepancy and free space safely?