Skip to content

GRPO dataset(s) #138

Answered by qgallouedec
ocramz asked this question in Q&A
Jan 31, 2025 · 1 comments · 2 replies
Discussion options

You must be logged in to vote

Is there a specific reason for this?

It's a nice dataset that have verifiable answer. But technically, you can use any verifiable data.

would it be good to have additional datasets of this "reasoning" form, possibly from domains other than math?

Indeed, we've a lot a of issues opened to suggest other domains. There's a lot to explore here, for sure.

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@ocramz
Comment options

@qgallouedec
Comment options

Answer selected by ocramz
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants