Skip to content

docs: add autoscale-inference-workloads-with-kaito blog#5507

Merged
sdesai345 merged 21 commits intomasterfrom
autoscale-inference-workloads-with-kaito
Feb 3, 2026
Merged

docs: add autoscale-inference-workloads-with-kaito blog#5507
sdesai345 merged 21 commits intomasterfrom
autoscale-inference-workloads-with-kaito

Conversation

@andyzhangx
Copy link
Copy Markdown
Contributor

No description provided.

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds a new blog post about autoscaling KAITO inference workloads on AKS using KEDA. The post introduces the alpha autoscaling feature released in KAITO v0.8.0 and provides a comprehensive guide for enabling intelligent autoscaling based on service monitoring metrics.

Key Changes

  • New blog post documenting KAITO inference workload autoscaling with KEDA
  • Includes architecture overview, prerequisites, installation steps, and quickstart guide
  • Demonstrates using the new InferenceSet CRD with KEDA's external scaler pattern

Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
@andyzhangx andyzhangx force-pushed the autoscale-inference-workloads-with-kaito branch from 6d99ae0 to 7ba066d Compare December 15, 2025 15:18
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Copy link
Copy Markdown
Contributor

@sdesai345 sdesai345 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added some comments!

@andyzhangx andyzhangx requested a review from Copilot January 8, 2026 13:32
@andyzhangx
Copy link
Copy Markdown
Contributor Author

@pauldotyu @sdesai345 I have addressed all you comments, could you take a look again? thx

Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Copy link
Copy Markdown
Contributor

@pauldotyu pauldotyu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor suggestions mostly around casing of "KEDA KAITO Scaler" throughout the doc.

Comment thread website/blog/2025-12-11-autoscale-inference-workloads-with-kaito/index.md Outdated
Copilot AI review requested due to automatic review settings January 28, 2026 08:38
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 2 out of 3 changed files in this pull request and generated 5 comments.

Comment thread website/blog/2026-02-03-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2026-02-03-autoscale-inference-workloads-with-kaito/index.md Outdated
Copy link
Copy Markdown
Contributor

@sanketbakshi1981 sanketbakshi1981 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@sdesai345
Copy link
Copy Markdown
Contributor

sdesai345 commented Jan 29, 2026

@andyzhangx can you update the folder name to reflect today's date (2026-01-29) before merging?

Copilot AI review requested due to automatic review settings February 2, 2026 07:53
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 2 out of 3 changed files in this pull request and generated 10 comments.

Comment thread website/blog/2026-02-03-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2026-02-03-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2026-02-03-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2026-02-03-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2026-02-03-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2026-02-03-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2026-02-03-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2026-02-03-autoscale-inference-workloads-with-kaito/index.md Outdated
Comment thread website/blog/2026-02-03-autoscale-inference-workloads-with-kaito/index.md Outdated
@andyzhangx andyzhangx removed request for a team, palma21 and seanmck February 3, 2026 01:48
@sdesai345 sdesai345 merged commit 0e247cd into master Feb 3, 2026
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants