Umbrella guidance: the workspace-root AGENTS.md is the source of truth for cross-repo thesis, boundaries, and rules. This file is the repo-specific authority for kin-infer.

kin-infer

Universal transformer inference engine in pure Rust. Custom GPU compute shaders, no external ML frameworks.

Build

cargo build                          # CPU only
cargo build --features metal         # macOS: Apple Metal GPU
cargo build --features cuda          # Linux/Windows: NVIDIA CUDA GPU
cargo build --features accelerate    # macOS: Accelerate BLAS (CPU)
cargo test --features metal          # run all tests including Metal GPU (Warning: can hit stale binary bugs)
./scripts/run-tests.sh               # RECOMMENDED: runs tests with clean environment (cleans stale binaries + GPU sweep)

Architecture

src/lib.rs — Core engine: model loading, forward pass, SIMD primitives (~2100 lines)
src/gpu.rs — GPU abstraction: GpuCompute trait, device discovery, CPU fallback
src/metal_backend.rs — Apple Metal: custom MSL compute shaders for matmul, softmax, norms, activations
src/cuda_backend.rs — NVIDIA CUDA: PTX kernels via driver API FFI (no toolkit needed at build time)

Feature flags

metal — Apple Metal GPU (macOS, M1/M2/M3). Deps: metal, objc2
cuda — NVIDIA CUDA via driver API (Linux/Windows). No build-time deps, just needs NVIDIA driver
accelerate — Apple Accelerate BLAS for CPU matmul
mkl — Intel MKL BLAS
openblas — OpenBLAS fallback

Key types

BertConfig / ModelArchitecture — model configuration and auto-detection
BertModel — loaded model with weights
KvCache — decoder-only KV cache for generation
SamplingParams — temperature, top-k, top-p, repetition penalty
gpu::GpuCompute — trait for GPU-accelerated tensor ops
gpu::GpuDeviceInfo — discovered GPU device information
gpu::discover_devices() — enumerate available GPUs
gpu::create_compute() — get best available compute backend

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

kin-infer

Build

Architecture

Feature flags

Key types

Uh oh!

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

kin-infer

Build

Architecture

Feature flags

Key types