Your application's database response times have increased by 300% over the last hour. Users are complaining about slow page loads. How do you investigate and resolve this?

10 min•Scenario

View Question→

🔧 Troubleshooting Scenarios

Slow CDN Performance

Intermediate

CDN Networking Performance

Users in one region report very slow page loads, but the rest of the world is fine. How do you troubleshoot this CDN performance issue?

10 min•Scenario

View Question→

🔧 Troubleshooting Scenarios

Kafka Consumer Lag

Advanced

Kafka Messaging Performance

Your Kafka consumer groups are showing high lag and messages are processing slowly. How do you investigate and remediate this?

15 min•Scenario

View Question→

🔧 Troubleshooting Scenarios

API Latency Spike

Intermediate

API Latency Performance+1

Your API’s average latency jumped from 100ms to 2s without an increase in traffic. How would you investigate?

10 min•Scenario

View Question→

🔧 Troubleshooting Scenarios

Message Queue Backlog

Intermediate

Messaging Queues Performance

Your RabbitMQ/SQS queue has millions of unprocessed messages. What steps do you take?

10 min•Scenario

View Question→

🔧 Troubleshooting Scenarios

Node CPU Thrashing

Intermediate

Linux Performance Troubleshooting

One node in your cluster shows 100% CPU usage with context switching spikes. How do you troubleshoot?

10 min•Scenario

View Question→

🔧 Troubleshooting Scenarios

Cache Stampede / Thundering Herd

Intermediate

Caching Performance Reliability

A cache eviction triggers a surge of requests to the origin, causing overload. How do you diagnose and prevent cache stampede?

10 min•Scenario

View Question→

🔧 Troubleshooting Scenarios

Overlapping Cron Jobs Causing Backlog

Beginner

Scheduling Operations Performance

Nightly maintenance jobs overlap and create resource contention and backlog. Explain your triage and prevention.

5 min•Scenario

View Question→

🔧 Troubleshooting Scenarios

High CPU Steal Time on VMs

Intermediate

Cloud Linux Performance

Services on certain VMs show latency spikes correlated with CPU steal time. How do you investigate and mitigate?

10 min•Scenario

View Question→

🔧 Troubleshooting Scenarios

Thread Pool Exhaustion Causing Latency

Intermediate

Performance Concurrency APIs

Sudden latency spikes correlate with saturated server thread pools. How do you diagnose and remediate safely?

10 min•Scenario

View Question→

🔧 Troubleshooting Scenarios

CDN Invalidation Storm Causes Origin Overload

Intermediate

CDN Caching Performance

A misconfigured deployment invalidates most CDN cache objects at once, flooding the origin. What’s your triage and prevention plan?

10 min•Scenario

View Question→

🔧 Troubleshooting Scenarios

Storage IOPS Throttling

Intermediate

Storage Cloud Performance

An application shows sudden latency spikes due to cloud storage IOPS limits being hit. How do you confirm and fix?

10 min•Scenario

View Question→