[TensorRT-LLM] KVCacheManager.add_sequence_with_eviction API compatibility issue #11387
Unanswered
ShaobinChen-AH
asked this question in
Q&A
Replies: 1 comment
-
|
How to solve this ? Should I remove final argument i.e., user_ids_set ? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Description
I'm using
tensorrt_llm.bindings.internal.batch_manager.KVCacheManagerfor KV cache management in a recommendation system. There seems to be an API compatibility issue with theadd_sequence_with_evictionmethod.Environment
Problem
The following code worked in an older version of TensorRT-LLM but now fails:
Beta Was this translation helpful? Give feedback.
All reactions