Question: How does WVA map a vLLM deployment to its EPP for SLO-based scaling?

## Context

We are integrating WVA with KServe. We have seen mentions of SLO-based scaling being introduced in a future release and want to ensure KServe is ready for it when it lands.

## Questions

1. How does WVA pick EPP metrics for SLO-based scaling? Which metrics are used and where are they fetched from?

2. Can you point us to the logic that maps a vLLM deployment (the scale target) to its corresponding EPP? We want to understand the full chain so we can verify it works correctly in a KServe environment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: How does WVA map a vLLM deployment to its EPP for SLO-based scaling? #824

Context

Questions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question: How does WVA map a vLLM deployment to its EPP for SLO-based scaling? #824

Description

Context

Questions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions