[QUESTION] why non-interleaved pipeline does not support overlap_p2p_comm? #1070

new-TonyWang · 2024-09-04T15:25:49Z

new-TonyWang
Sep 4, 2024

Your question
Ask a clear and concise question about Megatron-LM.
why non-interleaved does not support overlap_p2p_comm?

    if args.num_layers_per_virtual_pipeline_stage is not None:
        if args.overlap_p2p_comm:
            assert args.pipeline_model_parallel_size > 1, \
                'when interleaved schedule is used, pipeline-model-parallel size '\
                'should be greater than 1'
        else:
            assert args.pipeline_model_parallel_size > 2, \
                'when interleaved schedule is used and p2p communication overlap is disabled, '\
                'pipeline-model-parallel size should be greater than 2 to avoid having multiple '\
                'p2p sends and recvs between same 2 ranks per communication batch'
        assert args.num_layers % args.transformer_pipeline_model_parallel_size == 0, \
            'number of layers should be divisible by the pipeline parallel size'
        num_layers_per_pipeline_stage = args.num_layers // args.transformer_pipeline_model_parallel_size
        assert num_layers_per_pipeline_stage % args.num_layers_per_virtual_pipeline_stage == 0, \
            'number of layers per pipeline stage must be divisible number of layers per virtual pipeline stage'
        args.virtual_pipeline_model_parallel_size = num_layers_per_pipeline_stage // \
            args.num_layers_per_virtual_pipeline_stage
    else:
        args.virtual_pipeline_model_parallel_size = None
        # Overlap P2P communication is disabled if not using the interleaved schedule.
        args.overlap_p2p_comm = False
        if args.rank == 0:
            print('WARNING: Setting args.overlap_p2p_comm to False since non-interleaved '
                  'schedule does not support overlapping p2p communication')

Answered by CodersAcademy006

Jan 21, 2026

The overlap_p2p_comm optimization relies on the presence of independent virtual pipeline stages to hide communication latency behind computation.

In the non-interleaved (standard 1F1B) pipeline schedule, each rank owns a single pipeline stage with a strict Recv -> Compute -> Send dependency chain. There is no independent virtual-stage work available to execute while P2P communication is in flight.

As a result, enabling overlap in this setting would either break the dependency graph (by issuing sends/recvs out of order across ranks) or collapse back to serialized execution, defeating the purpose of overlap and risking NCCL ordering violations. For this reason, Megatron explicitly disables o…

View full answer

denght23 · 2026-01-20T06:32:21Z

denght23
Jan 20, 2026

Hi, I have the same question. Have you managed to solve it?

0 replies

CodersAcademy006 · 2026-01-21T13:22:40Z

CodersAcademy006
Jan 21, 2026

The overlap_p2p_comm optimization relies on the presence of independent virtual pipeline stages to hide communication latency behind computation.

In the non-interleaved (standard 1F1B) pipeline schedule, each rank owns a single pipeline stage with a strict Recv -> Compute -> Send dependency chain. There is no independent virtual-stage work available to execute while P2P communication is in flight.

As a result, enabling overlap in this setting would either break the dependency graph (by issuing sends/recvs out of order across ranks) or collapse back to serialized execution, defeating the purpose of overlap and risking NCCL ordering violations. For this reason, Megatron explicitly disables overlap_p2p_comm when using the non-interleaved schedule.

Hope this helps, please correct me otherwise. Thank you!!

1 reply

new-TonyWang Feb 26, 2026
Author

Thank you

1195343015 · 2026-02-10T03:26:37Z

1195343015
Feb 10, 2026

@CodersAcademy006 What he said is correct. Non-interleaved pipeline parallelism itself does not have space for p2p_comm overlap; this is an algorithmic issue rather than an engineering problem.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QUESTION] why non-interleaved pipeline does not support overlap_p2p_comm? #1070

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 3 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

[QUESTION] why non-interleaved pipeline does not support overlap_p2p_comm? #1070

Uh oh!

Uh oh!

new-TonyWang Sep 4, 2024

Replies: 3 comments · 1 reply

Uh oh!

denght23 Jan 20, 2026

Uh oh!

CodersAcademy006 Jan 21, 2026

Uh oh!

new-TonyWang Feb 26, 2026 Author

Uh oh!

1195343015 Feb 10, 2026

new-TonyWang
Sep 4, 2024

Replies: 3 comments 1 reply

denght23
Jan 20, 2026

CodersAcademy006
Jan 21, 2026

new-TonyWang Feb 26, 2026
Author

1195343015
Feb 10, 2026