Pull Request CPU Tests

Update Evaluation Logic to Latest `lm_eval` (0.4.8) and Support Automatic Benchmark Evals w/o Validation Set #190

Sign in to view logs

Summary
Jobs
- run-tests
Run details
- Usage
- Workflow file

Run time

Learn about OS pricing on GitHub Actions

Job	Run time
run-tests	3m 8s
	3m 8s