Commit 0c96763
committed
bump vllm-proxy-rs to 59e42dd6 + plumb CLOUD_API_USAGE_TOKEN
Brings all three compose files onto the latest reproducible
vllm-proxy-rs image (sha256:59e42dd6, :latest as of 2026-05-26
13:33Z — includes nearai/inference-proxy#143's service-token
reporting path plus subsequent main merges) and adds the
CLOUD_API_USAGE_TOKEN env var to every vllm-proxy-* service.
- GLM-5.1.yaml: was already on 59e42dd6 (bumped in a prior commit);
adds the env var to proxy-glm51.
- Qwen3.5-122B.yaml: 05ad3e83 → 59e42dd6 + env var on vllm-proxy-qwen35.
- small-models.yaml: 05ad3e83 → 59e42dd6 (shared anchor) + env var on
all 10 vllm-proxy-* services.
The env var is the inference-proxy half of the usage-endpoint lockdown
(cloud-api#665 added POST /v1/internal/usage; cloud-api side is already
live on staging + prod). When the host .env doesn't define
CLOUD_API_USAGE_TOKEN, Docker Compose interpolates "" → inference-proxy
coerces empty to None → can_use_service_token_path() is false → the
reporter stays on the legacy POST /v1/usage + sk- path. So this is
safe to deploy before the token is set on inference CVMs; the switch
to /v1/internal/usage only flips once the same secret (matching the
cloud-api side) is present in the host env.
Supersedes #46 (which targeted the older c1208db4 digest and predated
main's GLM-5.1 bump).1 parent 18bc809 commit 0c96763
3 files changed
Lines changed: 14 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
86 | 86 | | |
87 | 87 | | |
88 | 88 | | |
| 89 | + | |
89 | 90 | | |
90 | 91 | | |
91 | 92 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
18 | | - | |
| 18 | + | |
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
| |||
132 | 132 | | |
133 | 133 | | |
134 | 134 | | |
| 135 | + | |
135 | 136 | | |
136 | 137 | | |
137 | 138 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
30 | | - | |
| 30 | + | |
31 | 31 | | |
32 | 32 | | |
33 | 33 | | |
| |||
323 | 323 | | |
324 | 324 | | |
325 | 325 | | |
| 326 | + | |
326 | 327 | | |
327 | 328 | | |
328 | 329 | | |
| |||
379 | 380 | | |
380 | 381 | | |
381 | 382 | | |
| 383 | + | |
382 | 384 | | |
383 | 385 | | |
384 | 386 | | |
| |||
430 | 432 | | |
431 | 433 | | |
432 | 434 | | |
| 435 | + | |
433 | 436 | | |
434 | 437 | | |
435 | 438 | | |
| |||
481 | 484 | | |
482 | 485 | | |
483 | 486 | | |
| 487 | + | |
484 | 488 | | |
485 | 489 | | |
486 | 490 | | |
| |||
552 | 556 | | |
553 | 557 | | |
554 | 558 | | |
| 559 | + | |
555 | 560 | | |
556 | 561 | | |
557 | 562 | | |
| |||
580 | 585 | | |
581 | 586 | | |
582 | 587 | | |
| 588 | + | |
583 | 589 | | |
584 | 590 | | |
585 | 591 | | |
| |||
632 | 638 | | |
633 | 639 | | |
634 | 640 | | |
| 641 | + | |
635 | 642 | | |
636 | 643 | | |
637 | 644 | | |
| |||
675 | 682 | | |
676 | 683 | | |
677 | 684 | | |
| 685 | + | |
678 | 686 | | |
679 | 687 | | |
680 | 688 | | |
| |||
718 | 726 | | |
719 | 727 | | |
720 | 728 | | |
| 729 | + | |
721 | 730 | | |
722 | 731 | | |
723 | 732 | | |
| |||
767 | 776 | | |
768 | 777 | | |
769 | 778 | | |
| 779 | + | |
770 | 780 | | |
771 | 781 | | |
772 | 782 | | |
| |||
0 commit comments