Skip to content

Conversation

@SolarWindRider
Copy link
Contributor

@SolarWindRider SolarWindRider commented Nov 3, 2025

Fix a little bug in GRPOTrainer.

When inputs are conversational, bootstrap is a list not str, causing a str concat error.

I add a bootstrap type check to fix the bug while maintain the original program

@SolarWindRider SolarWindRider changed the title fix a little bug in completions with bootstrap [GRPOTrainer bug fix]fix a little bug in completions with bootstrap Nov 3, 2025
@SolarWindRider SolarWindRider changed the title [GRPOTrainer bug fix]fix a little bug in completions with bootstrap [GRPOTrainer bug fix] a little bug in completions with bootstrap Nov 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant