Skip to content

Milestone 3 - Launcher-Based Inference with Sleep/Wake (Test Release #5)

Choose a tag to compare

@aavarghese aavarghese released this 31 Mar 14:01
· 9 commits to main since this release
162829c

What's Changed

  • Null out serverDat.Sleeping when no vLLM instances associated yet by @waltforme in #359
  • deps(actions): bump docker/login-action from 3.7.0 to 4.0.0 by @dependabot[bot] in #323
  • deps(actions): bump actions/checkout from 4.2.2 to 6.0.2 by @dependabot[bot] in #324
  • deps(actions): bump docker/setup-buildx-action from 3.12.0 to 4.0.0 by @dependabot[bot] in #325
  • ci: fix actions/checkout version comments and pin by SHA by @MikeSpreitzer in #361
  • deps(actions): bump docker/build-push-action from 6.18.0 to 7.0.0 by @dependabot[bot] in #326
  • Add deploy_fma.sh and debug workflow for OCP E2E by @diegocastanibm in #357
  • Discontinue the usage of LauncherGeneratedBy label by @waltforme in #365
  • 🌱 Unify launcher unit testing by @MikeSpreitzer in #368
  • Improve launcher logging - Part 2 by @diegocastanibm in #367
  • Fix: Add enable-sleep-mode flag to enable sleep mode for vllm server by @aavarghese in #376
  • deps(actions): bump actions/setup-go from 6.2.0 to 6.3.0 by @dependabot[bot] in #360
  • Include creation parameters inline in launcher instance state replies by @MikeSpreitzer in #369
  • 🌱 Hot fix to e2e test on Openshift by @MikeSpreitzer in #382
  • Sync unbound launcher-based server-providing pods by @waltforme in #362
  • Preserve the final state at the end of the e2e test in kind by @waltforme in #390
  • Extract launcher E2E test scenarios into reusable script by @MikeSpreitzer in #386
  • Pin ko base image to chainguard/static digest by @MikeSpreitzer in #392
  • deps(actions): bump docker/setup-qemu-action from 3.2.0 to 4.0.0 by @dependabot[bot] in #370
  • deps(actions): bump actions/cache from 5.0.3 to 5.0.4 by @dependabot[bot] in #371
  • deps(actions): bump docker/metadata-action from 5.10.0 to 6.0.0 by @dependabot[bot] in #372
  • deps(go): bump the kubernetes group across 1 directory with 3 updates by @dependabot[bot] in #373
  • deps: bump code-generator from v0.34.2 to v0.34.6 by @MikeSpreitzer in #393
  • Dump logs for every container in e2e test by @waltforme in #394
  • Self-annotation on launcher pods to signal hosted instance changes by @waltforme in #391

Full Changelog: v0.5.1-alpha.4...v0.5.1-alpha.5