-
Notifications
You must be signed in to change notification settings - Fork 514
Pull requests: allenai/open-instruct
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Remove unused empty_cache parameter from weight sync
#1537
opened Mar 18, 2026 by
finbarrtimbers
Loading…
Batch vLLM weight sync broadcasts to fix 32k timeout
#1535
opened Mar 18, 2026 by
finbarrtimbers
Loading…
Offline Distillation via DistillKit (Part Two - Teacher Logit Capture)
#1534
opened Mar 18, 2026 by
wolfecameron
Loading…
Fix GPU_TESTS override regex in detect_gpu_tests_skip.sh
#1531
opened Mar 17, 2026 by
finbarrtimbers
Loading…
Add DeepSpeed universal checkpoint (UCP) support for GRPO
#1517
opened Mar 7, 2026 by
MohdElgaar
Loading…
Migrate to vLLM 0.16.0 native weight transfer API
#1515
opened Mar 6, 2026 by
finbarrtimbers
Loading…
Add SLR-Bench (Scalable Logical Reasoning) verifier and dataset support for RLVR
#1511
opened Mar 6, 2026 by
lukashelff
Loading…
Rename TIS ratio cap, add low bound and hard filter flag
#1503
opened Mar 2, 2026 by
finbarrtimbers
Loading…
Add AppWorld environment integration for GRPO
#1501
opened Feb 27, 2026 by
hamishivi
Loading…
3 tasks done
Fix dataset mixer split validation in combined datasets
#1494
opened Feb 24, 2026 by
MohdElgaar
Loading…
Add SWERLSandboxEnv for per-sample Docker tasks with submit-based evaluation
#1492
opened Feb 24, 2026 by
hamishivi
Loading…
4 tasks done
Remove vllm_num_engines from VLLMConfig; compute inline from cluster resources
#1482
opened Feb 19, 2026 by
finbarrtimbers
Loading…
Require checkpoint on Beaker restarts for DPO and GRPO training
codex
#1469
opened Feb 10, 2026 by
finbarrtimbers
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.