fix: simplify determine_available_memory by rebel-jaehwang · Pull Request #317 · RBLN-SW/vllm-rbln

rebel-jaehwang · 2026-01-30T08:50:00Z

We don't need to compute the number of blocks since vllm allocator
already does it, properly considering diffferent layer types.
Remove VLLM_RBLN_NPU_NUM_BLOCKS. User should use the standard
gpu_memory_utilization config instead.
Don't take min with max of memory used for active request, since
prefix cache can utilize the extra memory.

* We don't need to compute the number of blocks since vllm allocator already does it, properly considering diffferent layer types. * Remove VLLM_RBLN_NPU_NUM_BLOCKS. User should use the standard gpu_memory_utilization config instead. * Don't take min with max of memory used for active request, since prefix cache can utilize the extra memory.

rebel-wonsubkim

LGTM
thx a lot

* We don't need to compute the number of blocks since vllm allocator already does it, properly considering diffferent layer types. * Remove VLLM_RBLN_NPU_NUM_BLOCKS. User should use the standard gpu_memory_utilization config instead. * Don't take min with max of memory used for active request, since prefix cache can utilize the extra memory.

rebel-jaehwang requested a review from rebel-wonsubkim January 30, 2026 08:50

rebel-jaehwang force-pushed the avail-mem branch from ab28a18 to a58f24b Compare January 30, 2026 08:52

rebel-jaehwang force-pushed the avail-mem branch from a58f24b to 5d8bd7a Compare January 30, 2026 08:53

rebel-wonsubkim approved these changes Jan 30, 2026

View reviewed changes

rebel-jaehwang merged commit 4a52b54 into dev-0.12 Jan 30, 2026
1 check passed

rebel-jaehwang deleted the avail-mem branch January 30, 2026 17:06

rebel-jinhwan mentioned this pull request Feb 2, 2026

feat(worker): chunked pipeline parallelism (CPP) with async P2P send rebel-jinhwan/vllm-rbln#6

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: simplify determine_available_memory#317

fix: simplify determine_available_memory#317
rebel-jaehwang merged 1 commit intodev-0.12from
avail-mem

rebel-jaehwang commented Jan 30, 2026 •

edited

Loading

Uh oh!

rebel-wonsubkim left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rebel-jaehwang commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rebel-wonsubkim left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rebel-jaehwang commented Jan 30, 2026 •

edited

Loading