You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix: resolve cached model refs for training loads (#476)
## Summary
followup to #473.
expands using the `ModelRef` targets in metadata and Hugging Face
training loads.
---------
Signed-off-by: Matt Kornfield <mkornfield@nvidia.com>
Signed-off-by: Aaron Gonzales <aagonzales@nvidia.com>
Co-authored-by: Matt Kornfield <mkornfield@nvidia.com>
Co-authored-by: Cursor <cursoragent@cursor.com>
Copy file name to clipboardExpand all lines: docs/user-guide/running.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -275,7 +275,7 @@ execute in order (`config` → `dataframe` → `metadata` → `advisory`).
275
275
|-------|-------|-------------------|
276
276
|`gpu.cuda`| config | PyTorch is importable and a CUDA GPU is visible |
277
277
|`env.inference_key`| config |`NSS_INFERENCE_KEY` is set when PII classification is enabled (warning only) |
278
-
|`env.hf_token`| config |`HF_TOKEN` or `HUGGING_FACE_HUB_TOKEN`is set; warns unconditionally when neither is present so gated-repo downloads don't fail later (warning only)|
278
+
|`env.hf_model_availability`| config |The pretrained model reference is usable locally or can be fetched from Hugging Face; warns about a missing HF token only when online HF access may be needed|
279
279
|`dataset.size`| dataframe | Training split meets the hard minimum row count |
280
280
|`columns.groupby`| dataframe |`group_training_examples_by` column is present and has no nulls |
281
281
|`columns.orderby`| dataframe |`order_training_examples_by` column is present |
Copy file name to clipboardExpand all lines: docs/user-guide/troubleshooting.md
+4-1Lines changed: 4 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -467,7 +467,10 @@ check of its own.
467
467
|`no_gpu`| error |`gpu.cuda`| No CUDA GPU detected (required for training or generation) |
468
468
|`low_vram`| warning |`gpu.vram`| Free GPU VRAM may be insufficient |
469
469
|`inference_key_missing`| warning |`env.inference_key`|`NSS_INFERENCE_KEY` not set; PII classification degraded |
470
-
|`hf_token_missing`| warning |`env.hf_token`| Neither `HF_TOKEN` nor `HUGGING_FACE_HUB_TOKEN` set; gated model downloads may fail |
470
+
|`hf_token_missing`| warning |`env.hf_model_availability`| Neither `HF_TOKEN` nor `HUGGING_FACE_HUB_TOKEN` set, and model loading may need online Hugging Face access |
471
+
|`hf_model_not_cached`| warning/error |`env.hf_model_availability`| Hugging Face model is not present in the local cache; severity is error when HF offline mode is enabled |
472
+
|`hf_model_cache_incomplete`| error |`env.hf_model_availability`| Cached Hugging Face model snapshot is missing required config, tokenizer, weights, or shards |
473
+
|`hf_remote_code_not_cached`| warning/error |`env.hf_model_availability`| Trusted model references remote code that is not cached locally; severity is error when HF offline mode is enabled |
471
474
|`preflight.check_crash`| error | (crashing check) | A check raised an unexpected exception; the issue's `check` field names the crashing check and other checks continued running |
472
475
|`column_not_found`| error |`columns.groupby` / `columns.orderby`| Required column missing from dataset, or input DataFrame uses unsupported MultiIndex columns |
0 commit comments