Skip to content

Update Evaluation Logic to Latest lm_eval (0.4.8) and Support Automatic Benchmark Evals w/o Validation Set #203

Update Evaluation Logic to Latest lm_eval (0.4.8) and Support Automatic Benchmark Evals w/o Validation Set

Update Evaluation Logic to Latest lm_eval (0.4.8) and Support Automatic Benchmark Evals w/o Validation Set #203

Triggered via pull request May 9, 2025 15:32
Status Failure
Total duration 30s
Artifacts

.cpu_ci_on_pr.yml

on: pull_request
run-tests
26s
run-tests
Fit to window
Zoom out
Zoom in

Annotations

3 errors
run-tests
Process completed with exit code 1.
run-tests
Process completed with exit code 1.
run-tests
Process completed with exit code 1.