Release v0.5.0 · Andyyyy64/whichllm

What's New

`whichllm run` — One-command chat

Download and chat with any model instantly. Auto-creates an isolated environment, installs dependencies, and starts an interactive session — zero manual setup.

whichllm run "qwen 2.5 1.5b gguf"
whichllm run  # auto-picks the best model for your hardware

Supports all formats: GGUF, AWQ, GPTQ, FP16/BF16.

`whichllm snippet` — Ready-to-run Python code

Print a copy-paste Python script for any model.

whichllm snippet "qwen 7b"

Improvements

Smarter model search: auto-picks top match by downloads instead of erroring on ambiguous queries
Shared helpers for model loading and search across commands
Refactored plan command to use shared search logic

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.5.0

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

What's New

`whichllm run` — One-command chat

`whichllm snippet` — Ready-to-run Python code

Improvements

Uh oh!

v0.5.0

What's New

whichllm run — One-command chat

whichllm snippet — Ready-to-run Python code

Improvements

Uh oh!

`whichllm run` — One-command chat

`whichllm snippet` — Ready-to-run Python code