[Feature] Support Decoupled Speculation for SGLang Rollout

@Sparrow612
### Description

As described in https://arxiv.org/pdf/2511.16193, the Decoupled Speculation method can significantly accelerate the rollout phase in LLM post-training (RL). This issue proposes integrating this decoupling mechanism into the framework's rollout phase.

### Implementation Plan

Implement a Decoupled Speculation scheduler and runtime as outlined in the paper. The core change is to decouple the draft and verification steps: 
- allow the drafter and verifier to run asynchronously on separate GPU resources 
- enabling the drafter to proceed with drafting the next window of tokens without waiting for the verification result of the previous window

### Version Info

- veRL v0.7.0
- SGLang (Rollout backend) v0.5.8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Support Decoupled Speculation for SGLang Rollout #5559

Description

Implementation Plan

Version Info

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature] Support Decoupled Speculation for SGLang Rollout #5559

Description

Description

Implementation Plan

Version Info

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions