Pull Request CPU Tests

Update Evaluation Logic to Latest `lm_eval` (0.4.8) and Support Automatic Benchmark Evals w/o Validation Set #186

Sign in to view logs

Summary
Jobs
- run-tests
Run details
- Usage
- Workflow file

Run time

Learn about OS pricing on GitHub Actions

Job	Run time
run-tests	2m 52s
	2m 52s