We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 7f0d291 commit 7f700f2Copy full SHA for 7f700f2
research/trl/train_trl_grpo.py
@@ -1129,7 +1129,6 @@ def main() -> None:
1129
# ─────────────────────────────────────────────────────────────────────
1130
max_prompt_length=max_prompt_length, # max_length - max_completion_tokens
1131
max_completion_length=cfg.max_new_tokens, # GRPO_MAX_COMPLETION_TOKENS
1132
-
1133
1134
# Importance Sampling Level
1135
0 commit comments