Current Features + Future Plans
This document covers current clustering capabilities and planned sharding features.
- Standalone Mode - Single node, embedded or server
- Hot Standby - 2-node primary/standby with WAL replication
- Raft Cluster - 3-5 node strong consistency cluster
- Multi-Region - Per-region Raft clusters with async cross-region replication
| Tier | Nodes | Capacity | Status |
|---|---|---|---|
| Embedded | 1 | ~10K nodes | ✅ Available |
| Standalone | 1-2 | ~1M nodes | ✅ Available |
| Raft Cluster | 3-5 | ~10M nodes | ✅ Available |
| Multi-Region | 6+ | ~100M nodes | ✅ Available |
| Sharded | 10+ | ~10B+ nodes | 🔮 Planned |
┌───────────────────────────────────────────────────────────┐
│ Coordinator Layer │
│ (Query routing, metadata management) │
└─────────────────────────┬─────────────────────────────────┘
│
┌─────────────────┼─────────────────┐
│ │ │
▼ ▼ ▼
┌─────────┐ ┌─────────┐ ┌─────────┐
│ Shard A │ │ Shard B │ │ Shard C │
│ (Raft) │ │ (Raft) │ │ (Raft) │
└─────────┘ └─────────┘ └─────────┘
- Label-based - Co-locate nodes with same labels
- Hash-based - Consistent hashing for even distribution
- Analytics-driven - Use k-means/Louvain for intelligent placement
- Query Routing - Automatic routing to relevant shards
- Cross-shard Queries - Scatter-gather for distributed queries
- Vector Index Distribution - Per-shard HNSW indexes
- Live Rebalancing - Zero-downtime shard migration
Support for mixed-capability nodes:
┌─────────────┐ ┌─────────────┐ ┌─────────────┐
│ Raspberry Pi│ │ Desktop PC │ │ GPU Server │
├─────────────┤ ├─────────────┤ ├─────────────┤
│ ✅ BM25 │ │ ✅ BM25 │ │ ✅ BM25 │
│ ✅ Graph │ │ ✅ Graph │ │ ✅ Graph │
│ ❌ Vector │ │ ✅ Vector │ │ ✅ Vector │
│ ❌ Embed │ │ ⚠️ Embed │ │ ✅ GPU │
└─────────────┘ └─────────────┘ └─────────────┘
- Capability-based routing - Route queries to capable nodes
- Workload-based balancing - Dynamic load distribution
- Data locality - Keep related data together
Geographic distribution with async cross-region replication:
- ✅ Per-region Raft clusters (strong local consistency)
- ✅ Cross-region WAL streaming (async replication)
- ✅ Conflict resolution strategies (
last_write_wins,manual) - ✅ Configurable cross-region sync modes (
async,quorum) - ✅ Region failover and promotion
Extensively tested for real-world network conditions:
- Extreme latency: 2000-3000ms spikes (cross-region scenarios)
- Packet loss: Up to 20% packet loss handling
- Data corruption: Detection and recovery
- Connection drops: Automatic reconnection
- Byzantine failures: Malicious data, replay attacks
- Reordering: Out-of-order packet handling
See Clustering Guide for setup instructions.
| Phase | Target | Features |
|---|---|---|
| Phase 1 | ✅ Done | Hot Standby, Raft Cluster |
| Phase 2 | ✅ Done | Multi-Region with async replication |
| Phase 3 | 2025 H2 | Sharding coordinator |
| Phase 4 | 2026 | Full sharding, heterogeneous clusters |
- Clustering Guide - Current clustering features
- Replication Architecture - Technical details
- Scaling - Current scaling options