Skip to content

Commit 006aea1

Browse files
authored
[BugFix] Remove incorrect assert in split_decodes_and_prefills (vllm-project#36553)
Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>
1 parent 0836be3 commit 006aea1

File tree

1 file changed

+0
-1
lines changed

1 file changed

+0
-1
lines changed

vllm/v1/attention/backends/utils.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -528,7 +528,6 @@ def split_decodes_and_prefills(
528528
# requests may have a query length of 0 but since they are padding its fine
529529
# to treat them as decodes (ensures num_decodes matches the captured size)
530530
if torch.all((query_lens == query_lens[0]) | (query_lens == 0)):
531-
assert num_reqs * query_lens[0] == num_tokens, "tokens not padded correctly"
532531
return num_reqs, 0, num_tokens, 0 # all decodes
533532
is_prefill = query_lens != query_lens[0]
534533
else:

0 commit comments

Comments
 (0)