Scaling Databases¶

Overview¶

Scaling databases means increasing read/write throughput and storage without sacrificing correctness goals. Common strategies include replication, partitioning, caching, and choosing specialized engines.

Why This Exists¶

Single-node databases hit vertical limits. Teams shard data, add replicas, and introduce caches—each choice shifts failure modes and consistency guarantees.

How It Works¶

Study primary/replica replication, synchronous vs asynchronous, failover, partitioning/sharding keys, hot spots, read-after-write consistency, and global distribution trade-offs. Pair with System design — database scaling.

Architecture¶

architecture

flowchart LR App --> Router[Query router] Router --> Shard1[(Shard 1)] Router --> Shard2[(Shard 2)] Shard1 --> Replica1[Replica]

Key Concepts¶

Shard key selection Poor keys create uneven partitions and undo scale benefits; often combine user id with time bucketing for telemetry workloads.

Code Examples¶

Conceptual — routing pseudocode

def shard_for_tenant(tenant_id: int, num_shards: int) -> int:
    return hash(tenant_id) % num_shards

Interview Questions¶

How does leader-follower replication handle failures?

Promote a replica with bounded data loss depending on sync policy; split-brain risks require quorum/fencing.

What is a thundering herd on cache expiry?

Many requests miss cache simultaneously; mitigate with probabilistic early expiry, singleflight, or request coalescing.

Practice Problems¶

Design a sharding scheme for multi-tenant SaaS with per-tenant isolation
Compare read replicas vs caching for a read-heavy profile feed

Resources¶

Designing Data-Intensive Applications — replication and partitioning
AWS Database Blog — operational patterns