Skip to content

RLHF model return '{: {: {:' of every input #518

Open
@kuangdao

Description

@kuangdao

I used as readme of deepspeed chat.

training/step1_supervised_finetuning/training_scripts/single_node/run_1.3b.sh
training/step2_reward_model_finetuning/training_scripts/single_node/run_350m.sh
training/step3_rlhf_finetuning/training_scripts/single_node/run_1.3b.sh


python3 chat.py --path output/step3-models/1.3b/actor

bug when i want to try chat.py . the result is :

Enter input (type 'quit' to exit, 'clear' to clean memory): What is Edge?
------------------------------ Round 1 ------------------------------
 Human: What is Edge?
 Assistant:  {: {: {: {: {: {: {: {: {: {:
Enter input (type 'quit' to exit, 'clear' to clean memory): clear
Enter input (type 'quit' to exit, 'clear' to clean memory): hello
------------------------------ Round 1 ------------------------------
 Human: hello
 Assistant:  {: {: {: {: {: {: {: {: {: {: {: {: {:
Enter input (type 'quit' to exit, 'clear' to clean memory): clear
Enter input (type 'quit' to exit, 'clear' to clean memory): who are you?                    
------------------------------ Round 1 ------------------------------
 Human: who are you?
 Assistant:  {: {: {: {: {: {: {: {: {: {:
Enter input (type 'quit' to exit, 'clear' to clean memory):

every inputs it returns "{: {: {: {: {: {: {: {: {: {:" .

Metadata

Metadata

Labels

deespeed chatDeepSpeed ChatmodelingRelated to modeling questions.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions