[cool project / potential collaboration] Weval #278
simonaszilinskas
started this conversation in
Random
Replies: 1 comment 6 replies
-
|
EuroEval might also be worth a consideration here - generally though sampling datasets from comparia is somehting we have also looked into (probably more sample a problem space and then generative, revise and publish). e.g. we had a linguist look at common mistakes in danish in the leaderboard responses |
Beta Was this translation helpful? Give feedback.
6 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Just had a call with people behind Weval. As we're talking more and more about automating benchmarks on top of crowdsourced data, it could be interesting to propose a collaboration with them around our datasets. For example, we could take part of our dataset, let's say on French cuisine, study the human preferences and then construct a benchmark out of them.
Their team was great and their product is very cool !
Beta Was this translation helpful? Give feedback.
All reactions