v0.5.0
What's New
whichllm run — One-command chat
Download and chat with any model instantly. Auto-creates an isolated environment, installs dependencies, and starts an interactive session — zero manual setup.
whichllm run "qwen 2.5 1.5b gguf"
whichllm run # auto-picks the best model for your hardwareSupports all formats: GGUF, AWQ, GPTQ, FP16/BF16.
whichllm snippet — Ready-to-run Python code
Print a copy-paste Python script for any model.
whichllm snippet "qwen 7b"Improvements
- Smarter model search: auto-picks top match by downloads instead of erroring on ambiguous queries
- Shared helpers for model loading and search across commands
- Refactored
plancommand to use shared search logic