Milestone 3 - Launcher-Based Inference with Sleep/Wake (Test Release #5)
What's Changed
- Null out
serverDat.Sleepingwhen no vLLM instances associated yet by @waltforme in #359 - deps(actions): bump docker/login-action from 3.7.0 to 4.0.0 by @dependabot[bot] in #323
- deps(actions): bump actions/checkout from 4.2.2 to 6.0.2 by @dependabot[bot] in #324
- deps(actions): bump docker/setup-buildx-action from 3.12.0 to 4.0.0 by @dependabot[bot] in #325
- ci: fix actions/checkout version comments and pin by SHA by @MikeSpreitzer in #361
- deps(actions): bump docker/build-push-action from 6.18.0 to 7.0.0 by @dependabot[bot] in #326
- Add deploy_fma.sh and debug workflow for OCP E2E by @diegocastanibm in #357
- Discontinue the usage of LauncherGeneratedBy label by @waltforme in #365
- 🌱 Unify launcher unit testing by @MikeSpreitzer in #368
- Improve launcher logging - Part 2 by @diegocastanibm in #367
- Fix: Add enable-sleep-mode flag to enable sleep mode for vllm server by @aavarghese in #376
- deps(actions): bump actions/setup-go from 6.2.0 to 6.3.0 by @dependabot[bot] in #360
- Include creation parameters inline in launcher instance state replies by @MikeSpreitzer in #369
- 🌱 Hot fix to e2e test on Openshift by @MikeSpreitzer in #382
- Sync unbound launcher-based server-providing pods by @waltforme in #362
- Preserve the final state at the end of the e2e test in kind by @waltforme in #390
- Extract launcher E2E test scenarios into reusable script by @MikeSpreitzer in #386
- Pin ko base image to chainguard/static digest by @MikeSpreitzer in #392
- deps(actions): bump docker/setup-qemu-action from 3.2.0 to 4.0.0 by @dependabot[bot] in #370
- deps(actions): bump actions/cache from 5.0.3 to 5.0.4 by @dependabot[bot] in #371
- deps(actions): bump docker/metadata-action from 5.10.0 to 6.0.0 by @dependabot[bot] in #372
- deps(go): bump the kubernetes group across 1 directory with 3 updates by @dependabot[bot] in #373
- deps: bump code-generator from v0.34.2 to v0.34.6 by @MikeSpreitzer in #393
- Dump logs for every container in e2e test by @waltforme in #394
- Self-annotation on launcher pods to signal hosted instance changes by @waltforme in #391
Full Changelog: v0.5.1-alpha.4...v0.5.1-alpha.5