Skip to content

llm-efficiency-challenge/private-helm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Holistic Evaluation of Language Models

This is a fork of https://github.com/stanford-crfm/helm which we used for the 2023 NeurIPS LLM efficiency competition https://llm-efficiency-challenge.github.io/

It was private because the tasks we were testing on had to be undisclosed to the final participants and included

  • Math
  • Corr2cause
  • Justice
  • Samsum
  • Ethics

If you're interested in using these tasks in your own work please feel free to copy paste

About

HELM for hidden dataset eval

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 71