Skip to content

Actions: neuralmagic/lm-evaluation-harness

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
34 workflow runs
34 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

add arab_culture task (#3006)
Tasks Modified #17: Commit 8bc4aff pushed by anmarques
7m 27s main
add arab_culture task (#3006)
Unit Tests #17: Commit 8bc4aff pushed by anmarques
5m 31s main
use np.NaN (#2937)
Tasks Modified #15: Commit fc5019e pushed by anmarques
2m 3s main
use np.NaN (#2937)
Unit Tests #15: Commit fc5019e pushed by anmarques
6m 1s main
Longbench bugfix (#2895)
Unit Tests #14: Commit 930d837 pushed by anmarques
5m 40s main
Longbench bugfix (#2895)
Tasks Modified #14: Commit 930d837 pushed by anmarques
1m 29s main
Add GSM8K Platinum (#2771)
Unit Tests #12: Commit 11ac352 pushed by anmarques
2m 47s main
Add GSM8K Platinum (#2771)
Tasks Modified #12: Commit 11ac352 pushed by anmarques
11s main
Add MMLU-ProX task (#2811)
Unit Tests #11: Commit 8aeff14 pushed by anmarques
5m 4s main
Add MMLU-ProX task (#2811)
Tasks Modified #11: Commit 8aeff14 pushed by anmarques
1m 36s main
Add INCLUDE tasks (#2769)
Tasks Modified #10: Commit 6fbebb4 pushed by anmarques
21m 2s main
Add INCLUDE tasks (#2769)
Unit Tests #10: Commit 6fbebb4 pushed by anmarques
5m 19s main
Update evaluator.py (#2786)
Unit Tests #8: Commit 0f94477 pushed by anmarques
4m 27s main