add torch.no_grad() for block swap #161

kohya-ss · 2025-03-16T12:37:05Z

This may solve a potential issue and prevent loss=NaN in long training. Further testing is needed to see if existing training works as well as before.

kohya-ss · 2025-03-16T23:08:41Z

This fix appears to enable multi-GPU training of DDP, but further testing is required.

FurkanGozukara · 2025-03-16T23:10:54Z

This fix appears to enable multi-GPU training of DDP, but further testing is required.

with flux we were never able to do multi gpu training with block swap

so it is possible?

kohya-ss · 2025-03-16T23:14:30Z

so it is possible?

I think so. I will add a new branch to support this on sd-scripts repo today.

FurkanGozukara · 2025-03-16T23:15:07Z

so it is possible?

I think so. I will add a new branch to support this on sd-scripts repo today.

awesome

kohya-ss · 2025-03-17T12:54:55Z

This fix appears to enable multi-GPU training of DDP, but further testing is required.

Unfortunately this doesn't seem to work with nccl backend for now.

kohya-ss · 2025-03-18T23:13:14Z

This seems to be working fine on a Windows environment.
Testing on a Linux environment is welcome!

xzuyn · 2025-09-30T23:39:02Z

I've been using this on a single amd gpu on ubuntu for a bit, training qwen image. Haven't noticed any problems.

add torch.no_grad() in block-swap

7d66d4d

kohya-ss added the help wanted Extra attention is needed label Mar 16, 2025

Merge branch 'main' into no-grad-on-block-swap

74c99ba

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

add torch.no_grad() for block swap #161

add torch.no_grad() for block swap #161

Uh oh!

kohya-ss commented Mar 16, 2025 •

edited

Loading

Uh oh!

kohya-ss commented Mar 16, 2025

Uh oh!

FurkanGozukara commented Mar 16, 2025

Uh oh!

kohya-ss commented Mar 16, 2025

Uh oh!

FurkanGozukara commented Mar 16, 2025

Uh oh!

kohya-ss commented Mar 17, 2025

Uh oh!

kohya-ss commented Mar 18, 2025

Uh oh!

xzuyn commented Sep 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

add torch.no_grad() for block swap #161

Are you sure you want to change the base?

add torch.no_grad() for block swap #161

Uh oh!

Conversation

kohya-ss commented Mar 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kohya-ss commented Mar 16, 2025

Uh oh!

FurkanGozukara commented Mar 16, 2025

Uh oh!

kohya-ss commented Mar 16, 2025

Uh oh!

FurkanGozukara commented Mar 16, 2025

Uh oh!

kohya-ss commented Mar 17, 2025

Uh oh!

kohya-ss commented Mar 18, 2025

Uh oh!

xzuyn commented Sep 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kohya-ss commented Mar 16, 2025 •

edited

Loading