Cachex

A high-performance, feature-rich Go caching library with generics, layered caching, and serve-stale mechanism.

Features

🛡️ Cache Stampede Protection - Singleflight + DoubleCheck mechanisms eliminate redundant fetches, preventing traffic surge when hot keys expire
🚫 Cache Penetration Defense - Not-Found caching mechanism prevents malicious queries from overwhelming the database
🔄 Serve-Stale - Serves stale data while asynchronously refreshing, ensuring high availability and low latency
🎪 Layered Caching - Flexible multi-level caching (L1 Memory + L2 Redis), Client can also be used as upstream
🚀 High Performance - Sub-microsecond latency, 79x~1729x throughput amplification, zero error rate
🎯 Type-Safe - Go generics provide compile-time type safety, avoiding runtime type errors
⏱️ Flexible TTL - Independent fresh and stale TTL configuration for precise data lifecycle control
🔧 Extensible - Clean interface design makes it easy to implement custom cache backends

Quick Start

Installation

go get github.com/theplant/cachex

Basic Example

package main

import (
    "context"
    "fmt"
    "time"

    "github.com/theplant/cachex"
)

type Product struct {
    ID    string
    Name  string
    Price int64
}

func main() {
    // Create data cache
    cacheConfig := cachex.DefaultRistrettoCacheConfig[*cachex.Entry[*Product]]()
    cacheConfig.TTL = 30 * time.Second // 5s fresh + 25s stale
    cache, _ := cachex.NewRistrettoCache(cacheConfig)
    defer cache.Close()

    // Create not-found cache
    notFoundConfig := cachex.DefaultRistrettoCacheConfig[time.Time]()
    notFoundConfig.TTL = 6 * time.Second // 1s fresh + 5s stale
    notFoundCache, _ := cachex.NewRistrettoCache(notFoundConfig)
    defer notFoundCache.Close()

    // Define upstream data source
    upstream := cachex.UpstreamFunc[*cachex.Entry[*Product]](
        func(ctx context.Context, key string) (*cachex.Entry[*Product], error) {
            // Fetch from database or API
            // Return cachex.ErrKeyNotFound for non-existent keys
            product := &Product{ID: key, Name: "Product " + key, Price: 9900}
            return &cachex.Entry[*Product]{
                Data:     product,
                CachedAt: time.Now(),
            }, nil
        },
    )

    // Create client with all features enabled
    client := cachex.NewClient(
        cache,
        upstream,
        cachex.EntryWithTTL[*Product](5*time.Second, 25*time.Second), // 5s fresh, 25s stale
        cachex.NotFoundWithTTL[*cachex.Entry[*Product]](notFoundCache, 1*time.Second, 5*time.Second),
        cachex.WithServeStale[*cachex.Entry[*Product]](true),
    )

    // Use the cache
    ctx := context.Background()
    entry, _ := client.Get(ctx, "product-123")
    fmt.Printf("Product: %+v\n", entry.Data)
}

Architecture

sequenceDiagram
    participant App as Application
    participant Client as cachex.Client
    participant Cache as BackendCache
    participant NFCache as NotFoundCache
    participant SF as Singleflight
    participant Upstream

    App->>Client: Get(key)
    Client->>Cache: Get(key)

    alt Cache Hit + Fresh
        Cache-->>Client: value (fresh)
        Client-->>App: Return value
    else Cache Hit + Stale (serveStale=true)
        Cache-->>Client: value (stale)
        Client-->>App: Return stale value
        Client->>SF: Async refresh
        SF->>Upstream: Fetch(key)
        Upstream-->>SF: new value
        SF->>NFCache: Del(key)
        SF->>Cache: Set(key, value)
    else Cache Hit + Stale (serveStale=false) or TooStale
        Cache-->>Client: value (stale/too stale)
        Note over Client: Skip NotFoundCache, fetch directly<br/>(backend has data)
        Client->>SF: Fetch(key)
        SF->>Upstream: Fetch(key)
        Upstream-->>SF: value
        SF->>NFCache: Del(key)
        SF->>Cache: Set(key, value)
        SF-->>Client: value
        Client-->>App: Return value
    else Cache Miss
        Cache-->>Client: miss
        Client->>NFCache: Check NotFoundCache (if configured)
        alt NotFound Hit + Fresh
            NFCache-->>Client: not found (fresh)
            Client-->>App: Return ErrKeyNotFound
        else NotFound Hit + Stale (serveStale=true)
            NFCache-->>Client: not found (stale)
            Client-->>App: Return ErrKeyNotFound (stale)
            Client->>SF: Async recheck
            SF->>Upstream: Fetch(key)
            alt Key Still Not Found
                Upstream-->>SF: ErrKeyNotFound
                SF->>Cache: Del(key)
                SF->>NFCache: Set(key, timestamp)
            else Key Now Exists
                Upstream-->>SF: value
                SF->>NFCache: Del(key)
                SF->>Cache: Set(key, value)
            end
        else NotFound Hit + Stale (serveStale=false) or TooStale or Miss
            NFCache-->>Client: stale/too stale/miss
            Client->>SF: Fetch(key)
            SF->>Upstream: Fetch(key)
            alt Key Exists
                Upstream-->>SF: value
                SF->>NFCache: Del(key)
                SF->>Cache: Set(key, value)
                SF-->>Client: value
                Client-->>App: Return value
            else Key Not Found
                Upstream-->>SF: ErrKeyNotFound
                SF->>Cache: Del(key)
                SF->>NFCache: Set(key, timestamp)
                SF-->>Client: ErrKeyNotFound
                Client-->>App: Return ErrKeyNotFound
            end
        end
    end

Core Components

Client - Orchestrates caching logic, TTL, and refresh strategies (Client itself implements Cache interface and can also be used as upstream)
BackendCache - Storage layer (Ristretto, Redis, GORM, or custom), also serves as Upstream interface
NotFoundCache - Dedicated cache for non-existent keys to prevent cache penetration
Upstream - Data source (database, API, another Client, or custom)
Singleflight - Deduplicates concurrent requests for the same key (primary defense against cache stampede)
DoubleCheck - Re-checks backend and notFoundCache before upstream fetch to catch concurrent writes (eliminates race window)
Entry - Wrapper with timestamp for time-based staleness checks

Cache Backends

Ristretto (In-Memory)

High-performance, TinyLFU-based in-memory cache.

config := cachex.DefaultRistrettoCacheConfig[*Product]()
config.TTL = 30 * time.Second
cache, err := cachex.NewRistrettoCache(config)
defer cache.Close()

Redis

Distributed cache with customizable serialization.

cache := cachex.NewRedisCache[*Product](
    redisClient,
    "product:",     // key prefix
    30*time.Second, // TTL
)

GORM (Database)

Use your database as a cache layer (useful for persistence).

cache := cachex.NewGORMCache(
    db,
    "cache_products",
    30*time.Second,
)

Custom Cache

Implement the Cache[T] interface:

type Cache[T any] interface {
    Set(ctx context.Context, key string, value T, ttl time.Duration) error
    Get(ctx context.Context, key string) (T, error)
    Del(ctx context.Context, key string) error
}

Important: When a key does not exist, the Get method must return cachex.ErrKeyNotFound error, so the Client can correctly distinguish between cache misses and other error conditions.

Advanced Features

Layered Caching

Combine multiple cache layers for optimal performance. Client implements both Cache[T] and Upstream[T] interfaces, allowing it to be used directly as upstream for the next layer:

// L2: Redis cache with database upstream
l2Cache := cachex.NewRedisCache[*cachex.Entry[*Product]](
    redisClient, "product:", 10*time.Minute,
)

dbUpstream := cachex.UpstreamFunc[*cachex.Entry[*Product]](
    func(ctx context.Context, key string) (*cachex.Entry[*Product], error) {
        product, err := fetchFromDB(ctx, key)
        if err != nil {
            return nil, err
        }
        return &cachex.Entry[*Product]{
            Data:     product,
            CachedAt: time.Now(),
        }, nil
    },
)

l2Client := cachex.NewClient(
    l2Cache,
    dbUpstream,
    cachex.EntryWithTTL[*Product](1*time.Minute, 9*time.Minute),
)

// L1: In-memory cache with L2 client as upstream
// Client can be used directly as upstream for the next layer
l1Cache, _ := cachex.NewRistrettoCache(
    cachex.DefaultRistrettoCacheConfig[*cachex.Entry[*Product]](),
)
defer l1Cache.Close()

l1Client := cachex.NewClient(
    l1Cache,
    l2Client, // Client implements Upstream[T], use directly
    cachex.EntryWithTTL[*Product](5*time.Second, 25*time.Second),
    cachex.WithServeStale[*cachex.Entry[*Product]](true),
)

// Read: L1 miss → L2 → Database (if L2 also misses)
product, _ := l1Client.Get(ctx, "product-123")

Write Propagation

When you use a Client as the upstream for another Client, write operations (Set/Del) automatically propagate through all cache layers, stopping naturally when upstream doesn't implement Cache[T]:

L1 Cache → L2 Cache → L3 Cache → Database
   ✅        ✅         ✅          ❌ (auto-stop)

The propagation works through type-based detection: if upstream implements Cache[T] interface, writes propagate; if upstream doesn't implement Cache[T] (e.g. UpstreamFunc for data sources), propagation stops.

Pattern Support:

This design naturally supports both caching patterns:

Write-Through Pattern (Multi-Level Caches):

// All cache layers stay in sync
l1Client.Set(ctx, key, value)  // → L1 → L2 → ... → (stops at data source)

Cache-Aside Pattern (Cache + Database):

// Update database first, then cache
db.Update(user)
l1Client.Set(ctx, userID, user)  // Only updates cache layers, not DB

The key insight: cache writes propagate through Cache[T] chains but stop when upstream doesn't implement Cache[T], making it safe and correct for both patterns.

Not-Found Caching

Prevent repeated lookups for non-existent keys:

notFoundCache, _ := cachex.NewRistrettoCache(
    cachex.DefaultRistrettoCacheConfig[time.Time](),
)
defer notFoundCache.Close()

client := cachex.NewClient(
    dataCache,
    upstream,
    cachex.EntryWithTTL[*Product](5*time.Second, 25*time.Second),
    cachex.NotFoundWithTTL[*cachex.Entry[*Product]](
        notFoundCache,
        1*time.Second,  // fresh TTL
        5*time.Second,  // stale TTL
    ),
)

Custom Staleness Logic

Define custom staleness checks:

client := cachex.NewClient(
    cache,
    upstream,
    cachex.WithStale[*Product](func(p *Product) cachex.State {
        age := time.Since(p.UpdatedAt)
        if age < 5*time.Second {
            return cachex.StateFresh
        }
        if age < 5*time.Second + 25*time.Second {
            return cachex.StateStale
        }
        return cachex.StateTooStale
    }),
    cachex.WithServeStale[*Product](true),
)

Type Transformation

Transform between different cache types:

// Cache stores JSON strings
stringCache := cachex.NewRedisCache[string](client, "user:", time.Hour)

// Transform to User objects
userCache := cachex.JSONTransform[string, *User](stringCache)

// Use as Cache[*User]
user, err := userCache.Get(ctx, "user:123")

Performance

See BENCHMARK.md for detailed results.

Key Metrics (10K products, Pareto traffic distribution, cold start)

Scenario	Concurrency	Application QPS	Cache Hit Rate	P50	P99	DB Conn Pool	DB QPS	DB Utilization	Amplification	Errors
High Perf DB	600	504,989	99.81%	291ns	3.3µs	100	982.5	88.4%	514.0x	0%
Cloud DB	100	55,222	99.61%	833ns	12µs	20	213.8	90.9%	235.0x	0%
Shared DB	100	7,306	98.59%	791ns	831ms	13	103.0	99.0%	70.2x	0%
Constrained DB	100	695	94.01%	1.3µs	2.04s	8	41.6	98.8%	16.7x	0%

💡 Cold Start Performance: Cachex achieves 94%+ cache hit rate even during cold start without pre-warming. With cache pre-warming, throughput can increase dramatically (99%+ hit rate → minimal DB load).

🔥 Test Environment Simulation: All benchmark scenarios use realistic database connection pool simulation (semaphore-based), accurately simulating real-world database behavior.

📊 Throughput Amplification = Application QPS / Theoretical DB Capacity, where Theoretical DB Capacity = Conn Pool / (Latency / 1000ms).

FAQ

Q: When should I use `Entry[T]` vs custom staleness?

A: Use Entry[T] with EntryWithTTL for simple time-based expiration. Use custom staleness checkers when you need domain-specific logic (e.g., checking a version field).

Q: How does cache stampede protection work?

A: Cachex uses a two-layer defense based on the philosophy of concurrent exploration + result convergence:

Singleflight with Concurrency Control (Primary):
- Exploration phase: When cache misses, WithFetchConcurrency allows N concurrent fetches to maximize throughput
- Default (N=1): Full deduplication - only one fetch, others wait (99%+ redundancy elimination)
- N > 1: Moderate redundancy - requests distributed across N slots for higher throughput
DoubleCheck (Supplementary):
- Handles the narrow race window where Request B checks the cache (miss) before Request A completes its write
- Works across all singleflight slots, enabling fast convergence after first successful fetch
- Auto-enabled by default when notFoundCache is configured (smart detection)
- Configure with WithDoubleCheck(DoubleCheckEnabled/Disabled/Auto) based on your scenario

Q: What's the difference between fresh and stale TTL?

A: Fresh TTL defines how long data is considered fresh. Stale TTL defines an additional period during which data can be served as stale (with async refresh). Total lifetime = freshTTL + staleTTL.

Q: Should I cache all database queries?

A: No. Cache frequently accessed, relatively static data. Avoid caching:

Data that changes frequently (< 1s freshness requirement)
User-specific data with high cardinality
Large objects that don't fit in memory efficiently

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
.vscode		.vscode
.gitignore		.gitignore
BENCHMARK.md		BENCHMARK.md
BENCHMARK_ZH.md		BENCHMARK_ZH.md
LICENSE		LICENSE
README.md		README.md
README_ZH.md		README_ZH.md
TODO.md		TODO.md
TODO_ZH.md		TODO_ZH.md
benchmark_test.go		benchmark_test.go
bigcache.go		bigcache.go
bigcache_test.go		bigcache_test.go
client.go		client.go
client_test.go		client_test.go
double_check.go		double_check.go
double_check_test.go		double_check_test.go
entry.go		entry.go
entry_test.go		entry_test.go
error.go		error.go
go.mod		go.mod
go.sum		go.sum
gorm.go		gorm.go
gorm_test.go		gorm_test.go
redis.go		redis.go
redis_test.go		redis_test.go
ristretto.go		ristretto.go
ristretto_test.go		ristretto_test.go
testing.go		testing.go
transform.go		transform.go
transform_test.go		transform_test.go
types.go		types.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Cachex

Features

Quick Start

Installation

Basic Example

Architecture

Core Components

Cache Backends

Ristretto (In-Memory)

Redis

GORM (Database)

Custom Cache

Advanced Features

Layered Caching

Write Propagation

Not-Found Caching

Custom Staleness Logic

Type Transformation

Performance

Key Metrics (10K products, Pareto traffic distribution, cold start)

FAQ

Q: When should I use `Entry[T]` vs custom staleness?

Q: How does cache stampede protection work?

Q: What's the difference between fresh and stale TTL?

Q: Should I cache all database queries?

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

theplant/cachex

Folders and files

Latest commit

History

Repository files navigation

Cachex

Features

Quick Start

Installation

Basic Example

Architecture

Core Components

Cache Backends

Ristretto (In-Memory)

Redis

GORM (Database)

Custom Cache

Advanced Features

Layered Caching

Write Propagation

Not-Found Caching

Custom Staleness Logic

Type Transformation

Performance

Key Metrics (10K products, Pareto traffic distribution, cold start)

FAQ

Q: When should I use Entry[T] vs custom staleness?

Q: How does cache stampede protection work?

Q: What's the difference between fresh and stale TTL?

Q: Should I cache all database queries?

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Q: When should I use `Entry[T]` vs custom staleness?

Packages