[Bug]: scheduler issue with PD disaggregation

### Your current environment

<details>
<summary>The output of <code>python collect_env.py</code></summary>

vllm v0.11.0

</details>


### 🐛 Describe the bug

Hello, I got an issue of scheduler when deploying with PD disaggregation:
Since current scheduling strategy doesn't free blocks occupied by requests with WAITING_FOR_REMOTE_KVS state, will the server stuck in certain scenarios? 
For example, in step 4, the secheduler will allocate blocks for request 1 fisrt since it was put back to the front of the waiting queue in step 3. Then request 2 will never get into running queue since it requires more blocks for next token and the scheduler will get stuck in the loop from step 2 to step 4.

<img width="476" height="471" alt="Image" src="https://github.com/user-attachments/assets/c4150ee6-9ce7-40f3-b6d1-b49b30ba278e" />

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug]: scheduler issue with PD disaggregation #30659

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Bug]: scheduler issue with PD disaggregation #30659

Description

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions