Skip to content

Pull requests: modelscope/ms-swift

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[misc] change default values of topk/topp
#8442 opened Mar 26, 2026 by hjh0119 Loading…
Add loss scale from data
#8430 opened Mar 25, 2026 by hpsun1109 Loading…
[bugfix] fix megatron opsd with top-k
#8428 opened Mar 25, 2026 by hjh0119 Loading…
REAL Loss (Rewards as Labels) for GRPO Training
#8424 opened Mar 25, 2026 by li2zhi Loading…
2 of 4 tasks
[megatron] Refactor mcore bridge
#8422 opened Mar 25, 2026 by Jintao-Huang Loading…
Fix/environment variable in multinode train
#8413 opened Mar 24, 2026 by huangfu170 Loading…
3 tasks
Fix wandb logging step and metrics reporting
#8402 opened Mar 23, 2026 by SundayVHan Loading…
3 tasks
[megatron] support multimodal MTP
#8390 opened Mar 20, 2026 by Jintao-Huang Loading…
[megatron]add megatron log
#8348 opened Mar 16, 2026 by yangbofun Loading…
1 of 4 tasks
[megatron] support the fake distributed process group
#8347 opened Mar 16, 2026 by yangbofun Loading…
1 of 4 tasks
SymPO
#8245 opened Mar 9, 2026 by JiangWu0826 Loading…
1 of 4 tasks
Feature/ms swift custom
#8222 opened Mar 6, 2026 by LEWISZZZcc Loading…
4 tasks
[WIP] Moe kernel for qwen3 omni in ascend
#8214 opened Mar 5, 2026 by jiaqiw09 Loading…
1 of 4 tasks
feat: log grpo input images to wandb
#8157 opened Mar 2, 2026 by shunk031 Loading…
1 of 4 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.