Skip to content

Commit b720300

Browse files
committed
fix: use BlockSizeTokens param for estimated prefix cache
Signed-off-by: threcc <trecchiu@redhat.com>
1 parent a8a8193 commit b720300

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tests/model_serving/model_server/llmd/llmd_configs/config_estimated_prefix_cache.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ def _scheduler_config(cls):
3737
{
3838
"type": "prefix-cache-scorer",
3939
"parameters": {
40-
"blockSize": cls.block_size,
40+
"BlockSizeTokens": cls.block_size,
4141
"maxPrefixBlocksToMatch": 256,
4242
"lruCapacityPerServer": 31250,
4343
},

0 commit comments

Comments
 (0)