[QUESTION] How to Obtain Computation Model Graphs in Megatron-LM?

Hi everyone,

I'm currently working on a project involving Megatron-LM and I'm looking for a way to obtain the graphs (computation graphs) of sub-models after partitioning, along with the attributes of the operators. I've tried using tools such as torch.fx and the new compile and dynamo tools in PyTorch 2.0, but I've encountered several issues. It seems that some of these problems are related to the custom operators implemented in Megatron-LM.

Could anyone provide a feasible solution or guidance on how to achieve this? Any help would be greatly appreciated!

Thank you in advance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QUESTION] How to Obtain Computation Model Graphs in Megatron-LM? #832

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[QUESTION] How to Obtain Computation Model Graphs in Megatron-LM? #832

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions