Change llama 3 and 2 configs to use hf ckpt instead of meta ckpt

Update llama 3 and 2 to use hf ckpt, so users can use the trained model using .from_pretrained.

3.1 reference: https://github.com/pytorch/torchtune/blob/e9fd56a812cf0ba151fa164a45eb04056d099726/recipes/configs/llama3_1/8B_full.yaml#L39
3.0 currently: https://github.com/pytorch/torchtune/blob/e9fd56a812cf0ba151fa164a45eb04056d099726/recipes/configs/llama3/8B_full.yaml#L39

steps:
1) Change the checkpointer FullModelMetaCheckpointer -> FullModelHFCheckpointer
2) Update the download command to --ignore-patterns "original/consolidated.00.pth", instead of safetensors
3) Update checkpoint files to .safetensors
4) Launch training without errors

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change llama 3 and 2 configs to use hf ckpt instead of meta ckpt #2045

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Change llama 3 and 2 configs to use hf ckpt instead of meta ckpt #2045

Description

Activity

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions