GitHub - shannonxu-2018/PistaDB: The embedded vector database for LLM-native applications.

The embedded vector database for LLM-native applications.
RAG-ready · Zero dependencies · Single-file storage · MIT Licensed

Every LLM application eventually needs a vector database. For retrieval-augmented generation (RAG), semantic search, agent memory, or embedding caches — the standard answer is a cloud service or a containerized cluster. PistaDB disagrees.

Small, dense, and full of value — like the nut it's named after — PistaDB gives you a production-grade vector store in a single .pst file and a C library with zero dependencies. Ship it inside a desktop app. Bundle it in an edge device. Drop it next to your Python script. No Docker. No API keys. No data leaving the machine.

Why PistaDB?

	PistaDB	Cloud / Server Vector DB
Deployment	Copy a `.dll` / `.so`	Docker, Kubernetes, cloud subscriptions
Storage	One `.pst` file	Separate data + WAL + config + sidecar files
Privacy	All data stays local	Embeddings sent over the network
Memory	Configurable, minimal	GBs of JVM / runtime overhead
Dependencies	None (pure C99)	Dozens of packages
Latency	Sub-millisecond on a laptop	Network round-trips
Cost	Free forever (MIT)	Per-query or per-vector pricing

PistaDB is purpose-built for local RAG pipelines, offline AI agents, privacy-sensitive applications, edge inference, and anywhere shipping a full vector database cluster is impractical — which, honestly, is most places.

Key Features

8 Production-Ready Index Algorithms

Index	Algorithm	Best For
`LINEAR`	Brute-force exact scan	Ground truth, small embedding sets
`HNSW`	Hierarchical Navigable Small World	Recommended for RAG — best speed/recall tradeoff
`IVF`	Inverted File Index (k-means)	Large knowledge bases with a training budget
`IVF_PQ`	IVF + Product Quantization	Memory-constrained deployments
`DISKANN`	Vamana graph (DiskANN)	Billion-scale embedding collections
`LSH`	Locality-Sensitive Hashing	Ultra-low memory footprint
`SCANN`	Anisotropic Vector Quantization (Google ScaNN)	Maximum recall on MIPS / cosine workloads
`SQ`	Scalar Quantization (uint8)	4× memory & storage savings, no training needed

5 Distance Metrics — All LLM Embedding Models Covered

Metric	LLM / Embedding Use Case
`COSINE`	Text embeddings — OpenAI `text-embedding-3`, Cohere, `sentence-transformers`, BGE, GTE
`IP`	Inner product — embeddings already L2-normalised (same result as cosine, faster)
`L2`	Image / multimodal embeddings (CLIP, ImageBind)
`L1`	Sparse feature vectors, BM25-style hybrid retrieval
`HAMMING`	Binary embeddings, hash-based deduplication

Production Feature Set

SIMD-accelerated distance kernels — AVX2+FMA on x86-64, NEON on ARM, runtime-dispatched (4–8× scalar)
VecStore chunked storage — no scale ceiling; verified at 10 M vectors (HNSW) and 9 M full CRUD (IVF)
Transactions — ACID-style atomic multi-op groups with full undo-on-failure rollback
Multi-threaded batch insert — thread-pool + ring-buffer API for high-throughput embedding pipelines
Embedding cache — persistent LRU cache (.pcc) that eliminates redundant model calls
Single-file storage — CRC32-verified .pst format (lookup-table accelerated); atomic save, no partial writes
O(1) vector count — cached active-vector count maintained on insert/delete, no linear scans
Hardened internals — bitset bounds checks, heap empty-access guards, HNSW neighbor bounds validation on file load
9 language bindings — C, C++, Python, Go, Java, Kotlin, Swift, Objective-C, C#, Rust, WASM
109 / 109 tests passing across all features and platforms

Language & Platform Support

Language	Binding mechanism	Where to find it
C / C++	Direct `#include`	`src/pistadb.h` / `wrap/cpp/pistadb.hpp`
Python	`ctypes` (no Cython)	`wrap/python/`
Go	CGO	`wrap/go/`
Java	JNI	`wrap/android/src/main/java/`
Kotlin	JNI + extension functions	`wrap/android/src/main/kotlin/`
Objective-C	Direct C interop	`wrap/ios/Sources/PistaDBObjC/`
Swift	ObjC bridge	`wrap/ios/Sources/PistaDB/`
C#	P/Invoke	`wrap/csharp/`
Rust	FFI (`extern "C"`)	`wrap/rust/`
WASM	Emscripten / Embind	`wrap/wasm/`

Platform	Library output	ABI targets
Windows	`pistadb.dll`	x86_64
Linux	`libpistadb.so`	x86_64, aarch64
macOS	`libpistadb.dylib`	x86_64, arm64
Android	`libpistadb_jni.so`	arm64-v8a, armeabi-v7a, x86_64, x86
iOS / macOS	Static library (SPM)	arm64, arm64-Simulator, x86_64-Simulator
WASM	`.wasm`	— (planned)

Installation

1. Build the C Library

Windows (MSVC):

build.bat Release

Linux / macOS (GCC / Clang):

bash build.sh Release

Produces pistadb.dll (Windows) or libpistadb.so (Linux / macOS) with zero external dependencies.

2. Install the Python Binding

pip install -e wrap/python/

No Rust compiler. No CMake for the Python step. No surprises.

3. Android Integration

Open wrap/android/ as a library module in Android Studio, or declare it in settings.gradle:

include ':android'
project(':android').projectDir = new File('<path-to-PistaDB>/wrap/android')

The NDK build is handled automatically by wrap/android/CMakeLists.txt. Ensure NDK 26.x is installed and ndkVersion in wrap/android/build.gradle matches.

4. iOS / macOS Integration (Swift Package Manager)

In Xcode: File → Add Package Dependencies → point to this repository (or local checkout). Or add to your own Package.swift:

.package(path: "../PistaDB")

The Package.swift at the project root declares three targets — CPistaDB (C core), PistaDBObjC, and PistaDB (Swift) — wired together automatically by SPM.

5. WASM Integration

source /path/to/emsdk/emsdk_env.sh
cd wrap/wasm && bash build.sh
# → wrap/wasm/build/pistadb.js + pistadb.wasm

Serve both files from the same HTTP origin, or use directly in Node.js.

6. C++ Integration

add_subdirectory(PistaDB)
add_subdirectory(PistaDB/wrap/cpp)
target_link_libraries(my_app PRIVATE pistadb_cpp)

7. Go Integration

// go.mod
replace pistadb.io/go => ../PistaDB/wrap/go

export CGO_LDFLAGS="-L../PistaDB/build -lpistadb"
go get pistadb.io/go/pistadb
go build ./...

8. Rust Integration

cd wrap/rust
PISTADB_LIB_DIR=../../build cargo build --release

9. C# / .NET Integration

<!-- In your .csproj -->
<ItemGroup>
  <ProjectReference Include="../PistaDB/wrap/csharp/PistaDB.csproj" />
</ItemGroup>

# Windows: copy pistadb.dll next to your executable
copy build\Release\pistadb.dll MyApp\bin\Debug\net8.0\

# Linux: set LD_LIBRARY_PATH or copy libpistadb.so
export LD_LIBRARY_PATH=$PWD/build:$LD_LIBRARY_PATH

Quick Start

import numpy as np
from pistadb import PistaDB, Metric, Index, Params

params = Params(hnsw_M=16, hnsw_ef_construction=200, hnsw_ef_search=50)
db = PistaDB("mydb.pst", dim=1536, metric=Metric.COSINE, index=Index.HNSW, params=params)

vec = np.random.rand(1536).astype("float32")
db.insert(1, vec, label="chunk_0001")

query = np.random.rand(1536).astype("float32")
results = db.search(query, k=10)
for r in results:
    print(f"id={r.id}  dist={r.distance:.4f}  label={r.label!r}")

db.save()
db.close()

# Context manager — auto-closed on exit
with PistaDB("docs.pst", dim=768, metric=Metric.COSINE) as db:
    db.insert(1, vec, label="document excerpt")
    results = db.search(query, k=5)
    db.save()

For more examples — RAG pipelines, agent memory, advanced indexes, transactions, batch insert, embedding cache, and per-language integration guides — see the docs below.

Running Tests

# Windows
set PISTADB_LIB_DIR=build\Release
pytest tests\ -v

# Linux / macOS
PISTADB_LIB_DIR=build pytest tests/ -v

109 / 109 tests passing — recall benchmarks, roundtrip persistence, corrupt-file rejection, metric correctness, ScaNN two-phase search, and transaction atomicity / rollback.

Shuckr — Visual Database Browser

Shuckr is a standalone GUI tool for visually browsing and managing PistaDB .pst files — inspired by DB Browser for SQLite. Built with Python + PyQt6, it talks to the compiled native library (pistadb.dll / libpistadb.so) via ctypes.

Features: Create / open .pst files · Browse vectors with pagination · Insert / edit / delete vectors · k-NN search with random query generation · Database metadata & raw header inspection · Unsaved-changes tracking

Quick Start

cd Shuckr
pip install -r requirements.txt
python main.py

Or on Windows, simply double-click run.bat.

Screenshots

Database Info	Browse Data	Search

Documentation

Document	Contents
docs/examples.md	RAG pipelines, agent memory, all index types, transactions, batch insert, embedding cache
docs/language-bindings.md	Android, iOS/macOS, .NET, WASM, C++, Rust, Go — full integration guides
docs/benchmarks.md	Large-scale CRUD benchmarks, SIMD details, file format, project structure

Roadmap

Filtered search with metadata predicates (filter by source, date, tag before ANN)
LangChain and LlamaIndex integration (drop-in vectorstore)
Hybrid search: dense vector + sparse BM25 re-ranking in a single query
Full in-browser RAG pipeline via WASM (IDBFS persistence, SharedArrayBuffer workers)
HTTP microserver mode (optional, single binary, for multi-process access)

Contributing

Pull requests are warmly welcomed. Whether it's a new index algorithm, a language binding, a performance improvement, an LLM integration, or documentation — every contribution makes PistaDB better for the whole community.

Fork the repository
Create your feature branch (git checkout -b feat/langchain-integration)
Commit your changes
Open a Pull Request

Please ensure all 109 tests continue to pass before submitting.

Built in C99 · C++ · WASM · Python · Go · Java · Kotlin · Swift · Objective-C · C# · Rust · Runs anywhere · Keeps your data private
The best infrastructure for an LLM app is the kind you never have to think about.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Why PistaDB?

Key Features

8 Production-Ready Index Algorithms

5 Distance Metrics — All LLM Embedding Models Covered

Production Feature Set

Language & Platform Support

Installation

1. Build the C Library

2. Install the Python Binding

3. Android Integration

4. iOS / macOS Integration (Swift Package Manager)

5. WASM Integration

6. C++ Integration

7. Go Integration

8. Rust Integration

9. C# / .NET Integration

Quick Start

Running Tests

Shuckr — Visual Database Browser

Quick Start

Screenshots

Documentation

Roadmap

Contributing

About

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Shuckr		Shuckr
benchmarks		benchmarks
docs		docs
example_dbs		example_dbs
examples		examples
img		img
src		src
tests		tests
wrap		wrap
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
Package.swift		Package.swift
README.md		README.md
README_CN.md		README_CN.md
build.bat		build.bat
build.sh		build.sh

Folders and files

Latest commit

History

Repository files navigation

Why PistaDB?

Key Features

8 Production-Ready Index Algorithms

5 Distance Metrics — All LLM Embedding Models Covered

Production Feature Set

Language & Platform Support

Installation

1. Build the C Library

2. Install the Python Binding

3. Android Integration

4. iOS / macOS Integration (Swift Package Manager)

5. WASM Integration

6. C++ Integration

7. Go Integration

8. Rust Integration

9. C# / .NET Integration

Quick Start

Running Tests

Shuckr — Visual Database Browser

Quick Start

Screenshots

Documentation

Roadmap

Contributing

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages