Question: about CTran Send Recv

Hi, Thank you for your nice work! I have read about your paper [Collective Communication for 100k+ GPUs](https://arxiv.org/abs/2510.20171) especially Chapter 5.1 PP: Zero-copy and SM-free Send/Receive.

I wander how to use CTran to achieve SM-free and Zero Copy send recv in async way, without using NCCL copy-based send/recv or RDMA which would rely on pre-allocate buffer, or maybe register user tensor as RDMA MR every time when we launch send/recv? Is there a best practice?
In the evaluation chapter of your paper [Collective Communication for 100k+ GPUs](https://arxiv.org/abs/2510.20171), you have mention SM-Free and Zero Copy send recv. So I really really want to try it :)

Also I notice send recv in ncclx backend still using nccl, which is not SM Free and Zero Copy, I wander why we don't use Ctran to implement a better version that is SM Free and Zero Copy?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: about CTran Send Recv #86

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question: about CTran Send Recv #86

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions