Commit bce89dc
Fix canary validation artifact replication (#4173)
## Summary
- restore canary `tracker_metrics.jsonl` emission by setting
`replicate_path=this_output_path()` on the canary W&B tracker
- add a regression test that reloads the canary step and asserts metrics
replication stays configured
- fix the TPU canary post-run validation failure seen in scheduled runs
such as 23581948912, where training succeeded but metrics validation
failed because the artifact was never written
## Testing
- `./infra/pre-commit.py experiments/ferries/canary_ferry.py
tests/test_validate_canary_metrics.py`
- `uv run --with pytest --with pytest-timeout --with pytest-xdist python
-m pytest tests/test_validate_canary_metrics.py`
---------
Co-authored-by: yoblin <268258002+yoblin@users.noreply.github.com>1 parent 4f7d13c commit bce89dc
1 file changed
Lines changed: 1 addition & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
138 | 138 | | |
139 | 139 | | |
140 | 140 | | |
| 141 | + | |
141 | 142 | | |
142 | 143 | | |
143 | 144 | | |
| |||
0 commit comments