Skip to content

Commit 893b3d1

Browse files
authored
docs: clarify run disk cache usage
1 parent 9c96c2f commit 893b3d1

1 file changed

Lines changed: 26 additions & 0 deletions

File tree

docs/troubleshooting.md

Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -224,6 +224,32 @@ Try a known public GGUF model first:
224224
whichllm run "qwen 2.5 1.5b gguf"
225225
```
226226

227+
## How much disk space does `run` need?
228+
229+
Normal ranking commands do not download model weights. They cache Hugging Face
230+
model metadata and benchmark metadata under the whichllm cache.
231+
232+
`whichllm run` downloads the selected GGUF file through `huggingface_hub`. The
233+
required disk space is roughly the selected GGUF file size plus normal Hugging
234+
Face cache overhead.
235+
236+
By default, Hugging Face stores downloaded files under:
237+
238+
```text
239+
~/.cache/huggingface/hub
240+
```
241+
242+
You can move that cache by setting `HF_HOME` or `HF_HUB_CACHE`.
243+
244+
Cleanup is handled by the Hugging Face cache tools:
245+
246+
```bash
247+
hf cache scan
248+
hf cache delete
249+
```
250+
251+
whichllm does not currently delete model files automatically after a run.
252+
227253
## Ollama names do not match HuggingFace IDs
228254

229255
JSON output returns HuggingFace repo IDs:

0 commit comments

Comments
 (0)