[BUG]: Llama3.1-70B-instruct save model 

### Is there an existing issue for this bug?

- [X] I have searched the existing issues

### 🐛 Describe the bug

I trained reward model based on  Llama3.1-70B-instruct in 48 H100 (3d tp=8, pp=1, ).
When execute `booster.save_model(model, os.path.join(save_dir, "modeling"), shard=True)`, the size of `model.embed_tokens.weight` saved is [16064, 8192] rather than [128256, 8192]. However, the size of other weight are correct.

Please HELP ME!
Thank you!

### Environment

transformes 4.44.1
colosssalai 0.4.5
flash-attn 2.6.3


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: Llama3.1-70B-instruct save model #6108

Is there an existing issue for this bug?

🐛 Describe the bug

Environment

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG]: Llama3.1-70B-instruct save model #6108

Description

Is there an existing issue for this bug?

🐛 Describe the bug

Environment

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions