Distributed LRU Cache

A light‑weight, horizontal‑scaling key/value store that combines:

⚖️ Consistent hashing for near‑perfect shard balance
🔁 Configurable replication for availability / durability
⏱️ Per‑item TTL and LRU eviction to stay memory‑bounded
🌐 HTTP API so any language can consume it
🛠️ Pluggable storage backend (current drop‑in LRU in‑memory)
📊 Benchmark harness with uniform + Zipf workloads

Status Demo‑ready / learning project — not production hardened (yet 😉).

## 📸 Architecture

┌─────────────────────────┐        ┌─────────────────────────┐
│  Client / Benchmark     │        │  Client / Application    │
│  benchmarks/benchmark.py│  ...   │  (any language)          │
└──────────┬──────────────┘        └──────────┬──────────────┘
           │HTTP GET/POST                       │HTTP GET/POST
           ▼                                   ▼
┌─────────────────────────┐   ┌─────────────────────────┐   ┌─────────────────────────┐
│   Shard #1 (Flask)      │   │   Shard #2 (Flask)      │   │   Shard #3 (Flask)      │
│  → LRUCache instance    │   │  → LRUCache instance    │   │  → LRUCache instance    │
└──────────┬──────────────┘   └──────────┬──────────────┘   └──────────┬──────────────┘
           ▲  consistent‑hash ring        ▲                         ▲
           └───────────────Shard Manager / Replicator───────────────┘

Shard Manager → maps keys to shards via Ketama consistent hashing.
Replicator → applies the write to N successive nodes on the hash ring.
Every shard runs the same tiny Flask app that wraps an LRUCache.

## 🚀 Quick start

# 1) install deps (ideally in a venv)
python -m pip install -r requirements.txt

# 2) spin up a single shard on :5001
python main.py --port 5001 --capacity 100_000 --default-ttl 60

#  or run the built‑in 3‑node demo
python main.py --demo

# 3) hammer it with the benchmark harness
python benchmarks/benchmark.py \
        --mode http \
        --endpoint-file endpoints.yaml \
        --threads 32 \
        --ops 50_000 \
        --keys 5_000 \
        --read-ratio 0.9

🔧 Common options (click)

Flag	Default	What it does
`--capacity`	`100_000`	Max entries before LRU eviction
`--default-ttl`	`60` seconds	TTL if caller omits `ttl`
`--threads`	`32`	Parallel workers in benchmark
`--workload`	`uniform`	`uniform` or `zipf` key dist.
`--skew`	`1.2`	Zipf θ parameter

## 📑 HTTP API

Verb	Route	Body	Meaning
GET	`/cache/<key>`	–	Return JSON `{"value": ..}` or `404`.
POST	`/cache/<key>`	`{"value": …, "ttl": <sec?>}`	Insert/overwrite. `ttl` optional.
GET	`/health`	–	Simple liveness probe.

Note A write is proxied to N consecutive shards (see config.REPLICATION_FACTOR).

## 📈 Sample benchmarks (2025‑07‑08, local laptop)

Ops	Threads	Keys	P50 (ms)	P95 (ms)	Throughput (ops/s)
50 000	32	5 000	2.16	18.0	1 770
100 000	64	10 000	37.6	42.0	1 640

Flask’s built‑in server isn’t tuned for high QPS ― swap in Gunicorn / uvicorn for real deployments.

## 🗄️ Project layout

.
├── benchmarks/            ← load‑testing harness & workloads
├── cache_node.py          ← minimal Flask + LRU (used by main.py)
├── lru_cache.py           ← fast O(1) doubly‑linked‑list + dict impl
├── consistent_hash.py     ← Ketama ring
├── shard_manager.py       ← client‑side routing logic
├── replication.py         ← fan‑out writes to replicas
├── config.py              ← capacity, TTL, replica factor, endpoint list
└── main.py                ← CLI: run one shard or 3‑node demo

## 🤝 Contributing

Fork the repo & create a feature branch.
Follow black & ruff for lint/format.
Make sure pytest passes.
Open a PR — please include context & before/after benchmarks if you’re touching the hot path.

## 📜 License

Distributed LRU Cache is released under the MIT License — see LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed LRU Cache

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
__pycache__		__pycache__
benchmarks		benchmarks
README.md		README.md
config.py		config.py
consistent_hash.py		consistent_hash.py
endpoints.yaml		endpoints.yaml
lru_cache.py		lru_cache.py
main.py		main.py
node.py		node.py
project_structure.txt		project_structure.txt
replication.py		replication.py
requirements.txt		requirements.txt
shard_manager.py		shard_manager.py

Folders and files

Latest commit

History

Repository files navigation

Distributed LRU Cache

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages