Skip to content

Conversation

@cmunley1
Copy link
Contributor

@cmunley1 cmunley1 commented Jan 10, 2026

enables using environments hub envs in NeMo Gym with NeMo RL for training.

#446

Signed-off-by: Christian Munley <cmunley@nvidia.com>
Signed-off-by: Christian Munley <cmunley@nvidia.com>
Signed-off-by: Christian Munley <cmunley@nvidia.com>
Signed-off-by: Christian Munley <cmunley@nvidia.com>
Signed-off-by: Christian Munley <cmunley@nvidia.com>
Signed-off-by: Christian Munley <cmunley@nvidia.com>
Signed-off-by: cmunley1 <cmunley@nvidia.com>
@copy-pr-bot
Copy link

copy-pr-bot bot commented Jan 10, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@cmunley1 cmunley1 changed the title feat: verifiers integration supporting environments hub feat: verifiers / environments hub integration Jan 10, 2026
Signed-off-by: cmunley1 <cmunley@nvidia.com>
@cmunley1
Copy link
Contributor Author

image

ascii-tree

@cmunley1
Copy link
Contributor Author

image

acereason-math

@cmunley1
Copy link
Contributor Author

image

i3-math

@cmunley1 cmunley1 marked this pull request as ready for review January 11, 2026 20:02
@cmunley1 cmunley1 requested a review from a team as a code owner January 11, 2026 20:02
@cmunley1
Copy link
Contributor Author

image

multi turn seems to work, if we disable monotonicity checks and ensure consistent logprob dtype in nemo rl

Copy link
Member

@ahmadki ahmadki left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tests would also be appreciated

Signed-off-by: cmunley1 <cmunley@nvidia.com>
Signed-off-by: cmunley1 <cmunley@nvidia.com>
Signed-off-by: cmunley1 <cmunley@nvidia.com>
Signed-off-by: cmunley1 <cmunley@nvidia.com>
Signed-off-by: cmunley1 <cmunley@nvidia.com>
Signed-off-by: cmunley1 <cmunley@nvidia.com>
Signed-off-by: cmunley1 <cmunley@nvidia.com>
Signed-off-by: cmunley1 <cmunley@nvidia.com>
@cmunley1 cmunley1 requested a review from ahmadki January 13, 2026 19:50
ahmadki
ahmadki previously approved these changes Jan 22, 2026
Signed-off-by: Christian Munley <cmunley@nvidia.com>
Signed-off-by: Christian Munley <cmunley@nvidia.com>
Signed-off-by: Christian Munley <cmunley@nvidia.com>
@cmunley1
Copy link
Contributor Author

would like for someone from prime intellect to take a look, and also test more environments to provide a longer list of what is working today (seems to be various version mismatches or other issues with some envs).

however, I think this is good to merge. Can always open another PR for more verified environments, or based on PI feedback

@bxyu-nvidia @cwing-nvidia

Signed-off-by: Christian Munley <cmunley@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants