-
Notifications
You must be signed in to change notification settings - Fork 581
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] fix transforrmers api change at 5.2.0
run-ci-megatron
#1647
opened Feb 28, 2026 by
UbeCc
Loading…
Add slime skills for rollout, reward, filter, eval config, and CI
#1646
opened Feb 28, 2026 by
yitianlian
Loading…
add npu patch for qwen3-vl-8b grpo multi-turn based on tag v0.2.2
#1637
opened Feb 27, 2026 by
ascend-slime
Loading…
feat: add --lazy-multimodal-load to defer image process to rollout time
#1623
opened Feb 25, 2026 by
yzlnew
Loading…
fix(r3,vlm): remove orphaned RoutingReplay from decoder rebuild.
#1620
opened Feb 24, 2026 by
yxyOo
Loading…
[Feature] Add configurable arguments for rollout manager actor
#1596
opened Feb 18, 2026 by
TSunny007
Loading…
[Feature] Add curriculum learning example with dynamic multi-task training and online prompt filtering
#1594
opened Feb 18, 2026 by
zhangzx-uiuc
Loading…
[Fix] Minor fix for properly finishing / flushing wandb logging metrics at exit
#1592
opened Feb 17, 2026 by
silunw
Loading…
[Feature] Add modular tracking interface with MLflow backend
#1591
opened Feb 17, 2026 by
mouad-hpc
Loading…
4 tasks done
Add retries to the Remote Reward Model, do not fail on connection drops or endpoint instability
#1582
opened Feb 12, 2026 by
joyliu-q
Loading…
[Fix] Add missing space between --save and extra_args in convert_checkpoint
#1580
opened Feb 12, 2026 by
kaysonyu
Loading…
1 task done
fix issue#1558, --load with restoring megatron checkpoint but rollout start with 0
#1571
opened Feb 11, 2026 by
p1k0pan
Loading…
Add quantization support for bridge mode weight update
#1564
opened Feb 9, 2026 by
jairuigou
Loading…
fix: fix nvidia-modelopt version specifier in build_conda.sh
#1554
opened Feb 6, 2026 by
liujiahua123123
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.