Skip to content

v0.8.0

Latest

Choose a tag to compare

@github-actions github-actions released this 20 Dec 06:36
· 11 commits to main since this release
v0.8.0
4fc5af6

v0.8.0 - 2025-12-20

This release introduces a breaking change such that the inference workload is unified to StatefulSet. The Deployment resources created by existing workspaces will be removed by the controller and new StatefulSet resources will be created instead. No manual operation is required for this migration, and it is expected that the inference server hits a short period of downtime due to the Pod recreation.

Changelog

Breaking Changes 💥

  • 3ab3f3d feat: [BREAKING] use statefulset for all workspace (#1523)

Features 🌈

  • b966484 feat: update gpu-provisioner version to v0.3.8 for kaito (#1698)
  • 91819b9 feat: preset-generator support generic model format and attn arch (#1690)

Bug Fixes 🐞

Code Refactoring 💎

  • 47fcd2e refactor: make sku-calculation a generic preset generator (#1689)

Documentation 📘

  • 318bf01 docs: fix namespace doc issue in keda-kaito-scaler (#1699)
  • 87c9c32 docs: use kaito-workspace in keda install (#1694)
  • eefd2b8 docs: add keda-autoscaler-inference scaling example in doc (#1682)
  • bbe61d7 docs: refine naming in docs and examples (#1681)
  • c78d68b docs: add keda-autoscaler-inference doc (#1679)

Maintenance 🔧

  • 67deec5 chore: bump golang to 1.24.11 (#1695)
  • 89aba34 chore: use pv cleaner from localcsi manager (#1687)
  • 7911b00 chore: fix huggingface_hub version in preset_generator (#1693)
  • 0fabc5c chore: bump ray to 0.25.1 (#1684)
  • 3d33b89 chore: bump js-yaml from 3.14.1 to 3.14.2 in /website (#1647)
  • 601ad7b chore: bump mdast-util-to-hast from 13.2.0 to 13.2.1 in /website (#1657)
  • e1efaa8 chore: e2e tests for pv support in RAG engine service (#1671)

Testing 💚