[roadmap] verl Q1 roadmap

### Model Engine
- Switch default to new model engine
  - [ ] Mark legacy engine as deprecated
  - [ ] Feature parity new and legacy model engine: LoRA/PEFT, etc https://github.com/volcengine/verl/pull/4673, https://github.com/volcengine/verl/pull/4866, https://github.com/volcengine/verl/pull/4867

#### Megatron
- [ ] Perfomance optimization
   - [ ] Megatron dynamic CP https://github.com/verl-project/verl/pull/5057
   - [ ] MoE multi-modal model training
   - [ ] Long context training: fine-grained activation recompuation/offload

#### VeOmni
- [x] RL support: https://github.com/volcengine/verl/pull/4882

#### TorchTitan https://github.com/verl-project/verl/issues/5306

### Rollout Engine
- [x] Improve rollout server profiling: https://github.com/volcengine/verl/pull/4320
- [x] New rollout engine: TensorRT-LLM https://github.com/volcengine/verl/pull/4665
   - https://github.com/volcengine/verl/issues/5042 
- [x] Separate vllm worker from trainer, sync by cuda ipc https://github.com/volcengine/verl/pull/4280
- [x] Router reply
   - [x] vllm: https://github.com/volcengine/verl/issues/4101, https://github.com/vllm-project/vllm/pull/28284
   - [x] sglang: https://github.com/volcengine/verl/pull/4840
- [ ] AgentLoop
   - [ ] Refactor tool definition and registration
   - [ ] Support multiple AgentLoopOutput for one sample: prompt switch, context compression, multi-agent, etc.

### Checkpoint Engine
- [x] Add checkpoint engine abstract interface https://github.com/volcengine/verl/pull/4775
- [x] Add NCCL, NIXL transport backend and more https://github.com/volcengine/verl/pull/4885 https://github.com/volcengine/verl/pull/4954
- [x] Add checkpoint engine manager https://github.com/volcengine/verl/pull/5031 

### Trainer
- Online policy distillation: 
  - [ ] fsdp: https://github.com/volcengine/verl/pull/4897 
  - [ ] megatron: https://github.com/verl-project/verl/pull/5041
-  New sync trainer with TransferQueue
   - [ ] RFC: https://github.com/verl-project/verl/issues/5400
   - [ ] New sync trainer with TransferQueue: https://github.com/verl-project/verl/pull/5401
- Fully async trainer
   - [x] Refactor one-step-off/fully async with model engine and checkpoint engine https://github.com/verl-project/verl/pull/5029
   - [ ] Remove PartialAgentLoop https://github.com/verl-project/verl/pull/5430
   - [ ] Standalone megatron worker group to recompute old_log_prob

### Speculative Decoding
- [x] Support MTP SFT/RL training https://github.com/volcengine/verl/pull/4936 https://github.com/verl-project/verl/pull/4981
- [ ] https://github.com/verl-project/verl/pull/4947

### Ascend NPU
https://github.com/volcengine/verl/issues/4537

### Model Support List
https://github.com/verl-project/verl/issues/5389

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[roadmap] verl Q1 roadmap #4880

Model Engine

Megatron

VeOmni

TorchTitan #5306

Rollout Engine

Checkpoint Engine

Trainer

Speculative Decoding

Ascend NPU

Model Support List

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[roadmap] verl Q1 roadmap #4880

Description

Model Engine

Megatron

VeOmni

TorchTitan #5306

Rollout Engine

Checkpoint Engine

Trainer

Speculative Decoding

Ascend NPU

Model Support List

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions