Commit bab39ce
compat(dp,kv): drop removed DPMetadata.max_tokens_across_dp_cpu; add apply_admission_cap param
- vLLM 0.22 removed DPMetadata.max_tokens_across_dp_cpu; the max is
derivable from num_tokens_across_dp_cpu and has no consumers here.
- RBLNSlidingWindowManager: accept apply_admission_cap kwarg.
Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>1 parent b4fe0af commit bab39ce
2 files changed
Lines changed: 1 addition & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
166 | 166 | | |
167 | 167 | | |
168 | 168 | | |
169 | | - | |
170 | 169 | | |
171 | 170 | | |
172 | 171 | | |
| |||
178 | 177 | | |
179 | 178 | | |
180 | 179 | | |
181 | | - | |
182 | 180 | | |
183 | 181 | | |
184 | 182 | | |
185 | | - | |
186 | 183 | | |
187 | 184 | | |
188 | 185 | | |
| |||
221 | 218 | | |
222 | 219 | | |
223 | 220 | | |
224 | | - | |
225 | 221 | | |
226 | 222 | | |
227 | 223 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
53 | 53 | | |
54 | 54 | | |
55 | 55 | | |
| 56 | + | |
56 | 57 | | |
57 | 58 | | |
58 | 59 | | |
| |||
0 commit comments