Monitoring
All interview questions related to Monitoring
Name two commonly used log aggregation tools and briefly explain their benefits.
What is distributed tracing, and why is it important in microservices architectures?
Explain the roles of metrics, logs, and traces in observability, and how they complement each other.
Design a monitoring and alerting system for a microservices architecture running on Kubernetes. Consider metrics, logs, traces, and alerting.
Design a platform to serve ML models at scale with versioning, monitoring, A/B testing, and GPU utilization.
Your application's database response times have increased by 300% over the last hour. Users are complaining about slow page loads. How do you investigate and resolve this?
Your centralized logging pipeline stops ingesting logs from multiple services. How do you debug?
Alerts based on log ingestion are delayed by 15 minutes. Walk through diagnosing and fixing pipeline slowness.