Skip to content

[BUG] The different lr for Muon doesn't work? #7713

@Zazexy

Description

@Zazexy

I highly appreciate the improvements you've made in supporting the Muon optimizer. However, I've run into an issue when trying to set the individual muon_lr and adam_lr within the configuration JSON.
I've found that the optimizer still defaults to setting a single lr for both the Muon parameter group and the Adam parameter group. I've verified that I am on the latest commit, as the relevant processing code for this feature branch is present in my local engine.py.
All other settings in the configuration are working as expected, but the specific muon_lr and adam_lr settings are not taking effect. Has anyone else encountered this issue?
"optimizer": {
"type": "Muon",
"params": {
"lr": 1e-4,
"betas": [0.9, 0.95],
"eps": 1e-8,
"weight_decay": 0.01,
"muon_lr": 2e-2,
"adam_lr": 1e-5
}

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingtraining

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions