Local LLM Toolbox

A collection of lightweight setups for running LLMs locally.

Each subfolder is a self-contained environment for a specific inference backend. Pick the one that fits your hardware.

Backends

Backend	Platform	Model Format	Description
llama.cpp	macOS, Linux	GGUF	Nix-based dev environment with GPU auto-detection, router mode, RPC clustering
Foundry Local	macOS, Windows	ONNX	Microsoft's ONNX Runtime optimized inference

See the Hardware Setup Guide for GPU driver configuration, AMD memory tuning, and distributed inference networking.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
foundry		foundry
llama-cpp		llama-cpp
.gitignore		.gitignore
README.md		README.md
SETUP.md		SETUP.md