feat(llama): Add LLaMA2 7B & 70B model configuration files #20

Xiaoming-AMD · 2025-04-08T09:14:02Z

Add configuration files for Meta's LLaMA2 7B/70B models,

primus/backends/megatron/training/tokenizer/tokenizer.py

wenxie-amd · 2025-04-09T03:31:15Z

primus/configs/models/megatron/llama2_base.yaml

+init_method_std: 0.02
+
+# multi_latent_attention does not support apply_rope_fusion
+apply_rope_fusion: false


这个可以改成True，llama里面目前没有开启mla，可以用fusion

wenxie-amd

LGTM

Xiaoming Peng and others added 4 commits April 8, 2025 07:11

show memory usages

c3ed5d4

feat: add LLaMA2 7B & 70B model configuration files

e3ec0e8

style: update formatting for CI

e61d0c8

style: update formatting for CI

d5a0829

wenxie-amd reviewed Apr 8, 2025

View reviewed changes

primus/backends/megatron/training/tokenizer/tokenizer.py Outdated Show resolved Hide resolved

fix: add custom tokenizer-type for preprocess data

9e339fe

wenxie-amd reviewed Apr 9, 2025

View reviewed changes

tip: use apply_rope_fusion for llama2

c8aeb91

wenxie-amd approved these changes Apr 9, 2025

View reviewed changes

wenxie-amd merged commit f6aaa58 into main Apr 9, 2025
1 check passed

Xiaoming-AMD changed the title ~~[Feat] Add LLaMA2 7B & 70B model configuration files~~ feature(llama): Add LLaMA2 7B & 70B model configuration files Jun 4, 2025

Xiaoming-AMD changed the title ~~feature(llama): Add LLaMA2 7B & 70B model configuration files~~ feat(llama): Add LLaMA2 7B & 70B model configuration files Jun 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(llama): Add LLaMA2 7B & 70B model configuration files #20

feat(llama): Add LLaMA2 7B & 70B model configuration files #20

Uh oh!

Xiaoming-AMD commented Apr 8, 2025

Uh oh!

Uh oh!

wenxie-amd Apr 9, 2025

Uh oh!

wenxie-amd left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(llama): Add LLaMA2 7B & 70B model configuration files #20

feat(llama): Add LLaMA2 7B & 70B model configuration files #20

Uh oh!

Conversation

Xiaoming-AMD commented Apr 8, 2025

Uh oh!

Uh oh!

wenxie-amd Apr 9, 2025

Choose a reason for hiding this comment

Uh oh!

wenxie-amd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants