Scalability
All interview questions related to Scalability
Describe how you would design a Kubernetes architecture for multi-region high availability and low latency.
In a high-traffic microservices system using a distributed cache, how do you handle cache invalidation without breaking consistency?
Describe your approach for performing a zero-downtime database migration in production.
Explain how you would design and implement database sharding for a large-scale application.
What are the key design considerations for building resilient, event-driven systems at scale?
How do you scale a message queue system like Kafka or RabbitMQ to handle millions of messages per second?
What considerations would you make when scaling an API gateway for millions of requests per second?
How would you design a scalable ETL pipeline for processing terabytes of data daily with low latency?
How would you design data partitioning for a system that must handle billions of records with fast queries?
Design a global publish/subscribe system with millions of subscribers, durable delivery, and filtering.
Design a platform to serve ML models at scale with versioning, monitoring, A/B testing, and GPU utilization.
Throughput bottlenecks appear on a subset of shards. Outline your approach to identify and mitigate hot partitions.
Outbound calls from your private subnets start failing intermittently. Investigation suggests SNAT port exhaustion. How do you confirm and fix?
Share an example where your team faced scaling challenges due to sudden growth. What actions did you take to address them?