Skip to content

Conversation

@Xiaoming-AMD
Copy link
Collaborator

Align TFLOPs calculation

)
)


Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

是不是基于megatron最新的那个版本,然后把下面的 /2 去掉就行了?新的版本支持了moe_layer_freq
# Only half of the attention matrix is non-zero and needs to be multiplied with V.
+ (args.seq_length / args.hidden_size / 2)

Copy link
Contributor

@wenxie-amd wenxie-amd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@wenxie-amd wenxie-amd merged commit 2386abd into main Apr 22, 2025
2 checks passed
@Xiaoming-AMD Xiaoming-AMD deleted the dev/xiaoming/align_tflops_calculation branch April 22, 2025 05:52
@Xiaoming-AMD Xiaoming-AMD changed the title [Feat] Align TFLOPs calculation feature(megatron): Align TFLOPs calculation for megatron Jun 4, 2025
@Xiaoming-AMD Xiaoming-AMD changed the title feature(megatron): Align TFLOPs calculation for megatron feat(megatron): Align TFLOPs calculation for megatron Jun 25, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants