Skip to content

v0.5.0

Choose a tag to compare

@Andyyyy64 Andyyyy64 released this 09 Mar 15:17
· 149 commits to main since this release

What's New

whichllm run — One-command chat

Download and chat with any model instantly. Auto-creates an isolated environment, installs dependencies, and starts an interactive session — zero manual setup.

whichllm run "qwen 2.5 1.5b gguf"
whichllm run  # auto-picks the best model for your hardware

Supports all formats: GGUF, AWQ, GPTQ, FP16/BF16.

whichllm snippet — Ready-to-run Python code

Print a copy-paste Python script for any model.

whichllm snippet "qwen 7b"

Improvements

  • Smarter model search: auto-picks top match by downloads instead of erroring on ambiguous queries
  • Shared helpers for model loading and search across commands
  • Refactored plan command to use shared search logic