Skip to content

Initial E/PD extension for kind deployment#648

Open
revit13 wants to merge 3 commits intollm-d:mainfrom
revit13:kind_encoder
Open

Initial E/PD extension for kind deployment#648
revit13 wants to merge 3 commits intollm-d:mainfrom
revit13:kind_encoder

Conversation

@revit13
Copy link

@revit13 revit13 commented Feb 24, 2026

This PR is part of the implementation for #608. It extends the kind deployment to support Encode disaggregation.

This update configures the deployment to spin up a disaggregated architecture consisting of:

  • 1 Encoder pod

  • 1 Prefill/Decode (PD) pod

Notes:

Usage:

export EPD_ENABLED=true
export VLLM_SIMULATOR_IMAGE=ghcr.io/revit13/vllm-cpu-env:latest
export MODEL_NAME=Qwen/Qwen3-VL-2B-Instruct
make env-dev-kind

Signed-off-by: Revital Sur <eres@il.ibm.com>
Signed-off-by: Revital Sur <eres@il.ibm.com>
Signed-off-by: Revital Sur <eres@il.ibm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

1 participant