Skip to content

Cherry-pick muon to release3.4#78748

Merged
sneaxiy merged 6 commits intoPaddlePaddle:release/3.4from
xxyux:release/3.4
Apr 24, 2026
Merged

Cherry-pick muon to release3.4#78748
sneaxiy merged 6 commits intoPaddlePaddle:release/3.4from
xxyux:release/3.4

Conversation

@xxyux
Copy link
Copy Markdown
Contributor

@xxyux xxyux commented Apr 22, 2026

PR Category

Execute Infrastructure

PR Types

Improvements

Description

devPR:#78716
devPR:#78773

是否引起精度变化

…mpatibility

- muon_sharding_optimizer: replace hardcoded None/moe_expert color paths with
  generic _rank2params_2d_by_color iteration in step() and __init__ Step4;
  replace static _build_color_to_group_info(hcg) with dynamic
  _build_color_to_group_info_from_params(parameter_list, default_group) that
  scans param.color dicts at runtime; generalize reduce_gradients and
  _sharding_sync_parameters similarly; clean up comments (fix errors, translate
  Chinese to English, remove dead code and debug prints)

- muon: remove built-in QKV/FFN split logic (QKVInfo, qkv_info,
  intermediate_size, muon_qkv_update_mode, muon_ffn_split) from the optimizer;
  callers now pass slice strategies via MuonParamInfo.slice_func, keeping
  model-specific split logic out of the optimizer core; add ns_matmul_dtype
  parameter to Muon.__init__ and _zeropower_via_newtonschulz5 with auto-detect
  (bfloat16 on Ampere+, float32 on V100 and older) to enable CI on V100

- optimizer: allow Muon class to skip incompatible base-class checks

- test: update hybrid_parallel_sharding_muon_model and test_parallel_dygraph_muon
  to use current MuonParamInfo API (slice_func instead of deprecated qkv_info/
  intermediate_size); remove GPU capability >= 8 skipIf guard so tests run on V100

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@paddle-bot
Copy link
Copy Markdown

paddle-bot Bot commented Apr 22, 2026

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@codecov-commenter
Copy link
Copy Markdown

codecov-commenter commented Apr 22, 2026

Codecov Report

❌ Patch coverage is 89.13043% with 10 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (release/3.4@7875c6c). Learn more about missing BASE report.

Files with missing lines Patch % Lines
...d/fleet/meta_optimizers/muon_sharding_optimizer.py 89.28% 6 Missing ⚠️
python/paddle/optimizer/muon.py 88.57% 4 Missing ⚠️

❌ Your patch status has failed because the patch coverage (89.13%) is below the target coverage (90.00%). You can increase the patch coverage or adjust the target coverage.

Additional details and impacted files
@@              Coverage Diff               @@
##             release/3.4   #78748   +/-   ##
==============================================
  Coverage               ?   89.13%           
==============================================
  Files                  ?        3           
  Lines                  ?       92           
  Branches               ?        0           
==============================================
  Hits                   ?       82           
  Misses                 ?       10           
  Partials               ?        0           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@xxyux
Copy link
Copy Markdown
Contributor Author

xxyux commented Apr 23, 2026

/re-run all-failed

GuoxiaWang
GuoxiaWang previously approved these changes Apr 23, 2026
@xxyux xxyux changed the title feat: generalize MuonShardingOptimizer color handling and add V100 co… Cherry-pick muon to release3.4 Apr 23, 2026
@xxyux
Copy link
Copy Markdown
Contributor Author

xxyux commented Apr 23, 2026

/re-run all-failed

GuoxiaWang
GuoxiaWang previously approved these changes Apr 24, 2026
Copy link
Copy Markdown
Collaborator

@sneaxiy sneaxiy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@sneaxiy sneaxiy merged commit eda0f7f into PaddlePaddle:release/3.4 Apr 24, 2026
265 of 294 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants