File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -224,6 +224,32 @@ Try a known public GGUF model first:
224224whichllm run " qwen 2.5 1.5b gguf"
225225```
226226
227+ ## How much disk space does ` run ` need?
228+
229+ Normal ranking commands do not download model weights. They cache Hugging Face
230+ model metadata and benchmark metadata under the whichllm cache.
231+
232+ ` whichllm run ` downloads the selected GGUF file through ` huggingface_hub ` . The
233+ required disk space is roughly the selected GGUF file size plus normal Hugging
234+ Face cache overhead.
235+
236+ By default, Hugging Face stores downloaded files under:
237+
238+ ``` text
239+ ~/.cache/huggingface/hub
240+ ```
241+
242+ You can move that cache by setting ` HF_HOME ` or ` HF_HUB_CACHE ` .
243+
244+ Cleanup is handled by the Hugging Face cache tools:
245+
246+ ``` bash
247+ hf cache scan
248+ hf cache delete
249+ ```
250+
251+ whichllm does not currently delete model files automatically after a run.
252+
227253## Ollama names do not match HuggingFace IDs
228254
229255JSON output returns HuggingFace repo IDs:
You can’t perform that action at this time.
0 commit comments