Skip to content

Conversation

@tushar00jain
Copy link
Contributor

Differential Revision: D83512078

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 29, 2025
@facebook-github-bot
Copy link
Contributor

@tushar00jain has exported this pull request. If you are a Meta employee, you can view the originating diff in D83512078.

tushar00jain added a commit to tushar00jain/torchft that referenced this pull request Sep 29, 2025
Summary: Pull Request resolved: meta-pytorch#277

Differential Revision: D83512078
@facebook-github-bot
Copy link
Contributor

@tushar00jain has exported this pull request. If you are a Meta employee, you can view the originating diff in D83512078.

tushar00jain added a commit to tushar00jain/torchft that referenced this pull request Sep 29, 2025
Summary: Pull Request resolved: meta-pytorch#277

Differential Revision: D83512078
@facebook-github-bot
Copy link
Contributor

@tushar00jain has exported this pull request. If you are a Meta employee, you can view the originating diff in D83512078.

tushar00jain added a commit to tushar00jain/torchft that referenced this pull request Sep 30, 2025
Summary:
We don't restore outer optimizer state currently which can lead to bumps in loss because of high learning rate from a new replica. So save the outer optimizer state in the diloco specific state dict.


Differential Revision: D83512078
@facebook-github-bot
Copy link
Contributor

@tushar00jain has exported this pull request. If you are a Meta employee, you can view the originating Diff in D83512078.

Summary:
We don't restore outer optimizer state currently which can lead to bumps in loss because of high learning rate from a new replica. So save the outer optimizer state in the diloco specific state dict.


Reviewed By: d4l3k

Differential Revision: D83512078
@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 302fd39.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot. fb-exported Merged meta-exported

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants