Skip to content

agentgateway: test llm-d support #12451

@npolshakova

Description

@npolshakova

Goal:
Support kgateway + agentgateway for the existing llm-d kgateway instructions for Envoy extproc (see existing docs here: https://llm-d.ai/docs/architecture/Components/infra#prerequisites)

At a high level, this will require:

Context / Background:

  • kgtw with Envoy data plane is already supported as an Inference Gateway in llm-d-infra
  • The inference extension itself currently does not emit metrics. EPP and vLLM emit metrics, which are documented upstream.

Open Questions / Next Steps:

Metadata

Metadata

Assignees

Labels

Area: InferenceActivities related to Gateway API Inference Extension support.Area: agentgatewayPriority: HighRequired in next 3 months to make progress, bugs that affect multiple users, or very bad UX

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions