Open
Description
Hey, sorry if this is the wrong place. Feel free to move it to discussion.
I am trying to get peft working with fsdp2 and am wondering if someone else attempted that already?
The issue is that Im always getting errors along the lines of:
RuntimeError: aten.mm.default: got mixed torch.Tensor and DTensor, need to convert all torch.Tensor to DTensor before calling distributed operators!
Happy for any pointers.
Metadata
Assignees
Labels
No labels
Activity