Skip to content

Model performance suprisingly bad  #318

Open
@ruihan0495

Description

@ruihan0495

Dear all,

We are trying to reproduce the results, however, as we follow the training steps, our chatbot is keep repeating a nonsense. We suspect that our RLHF part is bad, so we simply load the pretrained model, and the result is also very very bad. Anyone has the same issue? If you successfully trained a decent chatbot, do you have any bitter lesson that could share across the community?
Thanks!

Kind regards,
Jade

Metadata

Metadata

Assignees

Labels

deespeed chatDeepSpeed ChatmodelingRelated to modeling questions.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions