IntermediateScenario
10 min
Cloud API Throttling (429) Causing Failures
CloudRate LimitingReliability
Advertisement
Interview Question
Background jobs calling a cloud provider API start failing with 429 Too Many Requests during peak hours. How do you stabilize and prevent it?
Key Points to Cover
- Measure request rate vs provider quotas; identify burst patterns
- Implement client-side backoff/jitter and respect Retry-After
- Shard credentials/projects or request quotas where appropriate
- Batch requests and add concurrency controls
- Add dashboards/alerts on 429 rates and latency
Evaluation Rubric
Quantifies usage vs quotas30% weight
Implements robust retry/backoff30% weight
Batches/shards to reduce bursts20% weight
Monitors 429s and adjusts proactively20% weight
Hints
- 💡Prefer token bucket style client throttling.
Potential Follow-up Questions
- ❓When to request quota increases?
- ❓How to test under quota constraints?
Advertisement