The min_length setting force the model generate to max length, which produce repeated or nonsense result

https://github.com/microsoft/DeepSpeedExamples/blob/8f8099a813f3b223d5df39e0c15c748de4eb1669/applications/DeepSpeed-Chat/training/step3_rlhf_finetuning/ppo_trainer.py#L76

When i try to reproduce bloom, i meet the same problem:
"The min_length setting force the model generate to max length, which produce repeated or nonsense result."
  [fix ppo_trainer generate and scores calculation in stage 2](https://github.com/microsoft/DeepSpeedExamples/pull/347)
  
So i try to delete the "min_length setting", but i find the program can't continue to run at https://github.com/microsoft/DeepSpeedExamples/blob/8f8099a813f3b223d5df39e0c15c748de4eb1669/applications/DeepSpeed-Chat/training/step3_rlhf_finetuning/ppo_trainer.py#L105

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The min_length setting force the model generate to max length, which produce repeated or nonsense result #539

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

The min_length setting force the model generate to max length, which produce repeated or nonsense result #539

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions