AdvancedSystem-Design
45 min
Design a Distributed Search Engine
SearchIndexingShardingReplication
Advertisement
Interview Question
Design a distributed search engine like Elasticsearch that supports indexing, querying, replication, and relevance ranking.
Key Points to Cover
- Indexing pipeline with analyzers, tokenization, stop-words
- Inverted index data structures, postings, compression
- Sharding and replication strategies for query load
- Relevance scoring models (TF-IDF, BM25, learning-to-rank)
- Cluster management, rebalancing, failover
Evaluation Rubric
Solid indexing pipeline & structures25% weight
Efficient query processing & ranking25% weight
Scalable sharding/replication model25% weight
Cluster mgmt and failover strategy25% weight
Hints
- 💡Think about write amplification and index refresh trade-offs.
Potential Follow-up Questions
- ❓How do you support phrase queries?
- ❓How to scale for multi-tenant workloads?
Advertisement