[ERROR]In Step3，load reward Model failed which trainged with zero-stage 3

In step 3, I modified the zero-stage of the critic model and reward model from 0 to 3, and the Reward Model trained with zero-stage 3 in step 2 cannot be loaded。But we can load reward model with zero-stage 0 in rw_eval.py

err：size mismatch for rwtranrsformer.layers.0.self_attn.q_proj.weight: copying a param with shape torch.Size([4096, 4096]) from checkpoint, the shape in current model is torch.Size([0]).

modified the zero-stage of the critic model and reward model from 0 to 3：
![image](https://user-images.githubusercontent.com/29229751/234289000-60eaa5a3-602d-4797-88a1-c1735917f84e.png)




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ERROR]In Step3，load reward Model failed which trainged with zero-stage 3 #429

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[ERROR]In Step3，load reward Model failed which trainged with zero-stage 3 #429

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions