Skip to content

Commit 0969842

Browse files
committed
remove redundant lines
Signed-off-by: wangli <[email protected]>
1 parent 500b573 commit 0969842

File tree

3 files changed

+10
-12
lines changed

3 files changed

+10
-12
lines changed

vllm_ascend/pool/__init__.py

Whitespace-only changes.

vllm_ascend/pool/medatata.py

Lines changed: 0 additions & 11 deletions
This file was deleted.

vllm_ascend/worker/npu_input_batch.py

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,10 +26,19 @@
2626
LogitsProcessors)
2727
from vllm.v1.worker.gpu_input_batch import InputBatch
2828

29-
from vllm_ascend.pool.medatata import PoolingStates
3029
from vllm_ascend.worker.block_table import MultiGroupBlockTable
3130

3231

32+
class PoolingStates:
33+
# NOTE: This should be removed after we drop support of vLLM v0.12.0
34+
def __init__(self):
35+
# for chunked prefill with ALL pooling
36+
self.hidden_states_cache: list[torch.Tensor] = []
37+
38+
def clean(self):
39+
self.hidden_states_cache.clear()
40+
41+
3342
class NPUInputBatch(InputBatch):
3443

3544
def __init__(

0 commit comments

Comments
 (0)