Chunk size in streaming #308

Open

opened

on Jan 21, 2026

We currently send each response token in a separate chunk. We should consider a better approach, probably using VLLM_V1_OUTPUT_PROC_CHUNK_SIZE.

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests