Adds EP-10494: AI APIs enhancement proposal #10495

npolshakova · 2025-01-23T18:14:17Z

Description

Adds Enhancement Proposal 10494 that proposes adding AI Gateway APIs support.

Supports #10494

Initial APIs can be found in this draft PR: #10493

docs/content/enhancements/10494.md

EItanya

This proposal LGTM

docs/content/enhancements/10494.md

npolshakova · 2025-01-28T21:57:29Z

@linsun / @yuval-k can I get another review?

docs/content/enhancements/10494.md

design/10494.md

andy-fong · 2025-02-12T00:10:27Z

design/10494.md

+spec:
+  ai:
+    vertexAi:
+      model: gemini-1.5-flash-001


Should we mention what would happen if the URL or the request body contains the model value that's not the same as the model value in the Upstream?

Yep, I'll add a sentence about if the model is not provided as well. We use the Upstream in the case of a mismatch, right?

design/10494.md

lgadban · 2025-02-14T19:29:32Z

design/10494.md

+          name: openai-secret
+```
+
+Notice that this Upstream does not specify a model, so kgateway will use the model value in the request to determine 


kgateway will use the model value in the request

What does this mean? Might be worth elaborating here.

Can add a specific request example in a follow up, but for context this field isn't strictly required:

spec: ai: openai: model: my-model

You can specify a model in the request body instead, but if you specify both the Upstream will win out. (This additional note is more implementation detail, but was request in an earlier review to clarify the behavior)

Right but my question is how is it specified in the request body?

lgadban · 2025-02-14T19:30:07Z

design/10494.md

+    filters:
+    - type: ExtensionRef
+      extensionRef:
+        group: gateway.kgateway.dev/v1alpha1
+        kind: RoutePolicy
+        name: open-ai-opt


Prefer using targetRef attachment on the RoutePolicy rather than ExtensionRef?

See the comment:

kgateway/design/10494.md

Line 49 in f7cb884

Note: For the first implementation, a RoutePolicy must be applied to a specific route in the HTTPRoute using an `extensionRef`.

Initially it's easier to only support extension ref for the first pass because it's a little trickier to confirm all the routes are AI Upstream types in the new plugin system. targetRef support will be added later if there's a specific use case for it.

This is worrying because ExtensionRef is effectively redundant given named route rules.
I think we'll want to consider this as important for the plugin system if things are easier to express as ExtensionRefs instead of via policy attachment.
cc @yuval-k

design/10494.md

EItanya

These changes make sense to me

npolshakova requested review from yuval-k, danehans and EItanya January 23, 2025 19:36

linsun reviewed Jan 24, 2025

View reviewed changes

docs/content/enhancements/10494.md Outdated Show resolved Hide resolved

EItanya reviewed Jan 27, 2025

View reviewed changes

lgadban reviewed Jan 28, 2025

View reviewed changes

docs/content/enhancements/10494.md Outdated Show resolved Hide resolved

npolshakova requested a review from linsun January 28, 2025 21:57

npolshakova requested a review from lgadban January 31, 2025 16:29

yuval-k reviewed Feb 3, 2025

View reviewed changes

docs/content/enhancements/10494.md Outdated Show resolved Hide resolved

docs/content/enhancements/10494.md Outdated Show resolved Hide resolved

docs/content/enhancements/10494.md Outdated Show resolved Hide resolved

docs/content/enhancements/10494.md Outdated Show resolved Hide resolved

npolshakova mentioned this pull request Feb 6, 2025

Add support for AI APIs #10493

Merged

4 tasks

npolshakova requested review from EItanya and yuval-k February 11, 2025 17:34

andy-fong suggested changes Feb 12, 2025

View reviewed changes

npolshakova added 9 commits February 12, 2025 15:46

add ai api enhancement proposal

e876a96

fix links

878dbca

reword

042307c

add additional sections and examples

b311dea

add deepseek custom hosting example, highlight egress gw usecase

bd88126

fix typos

18dc546

feedback

384fa3a

move EP

e1dc2c1

feedback

b1fe48b

npolshakova force-pushed the add-ai-api-enhancement-proposal branch from 3ab7979 to b1fe48b Compare February 12, 2025 20:46

andy-fong reviewed Feb 12, 2025

View reviewed changes

design/10494.md Outdated Show resolved Hide resolved

design/10494.md Show resolved Hide resolved

npolshakova added 2 commits February 12, 2025 17:32

address feedback

da5c601

fix path

da8ef8a

npolshakova requested a review from andy-fong February 12, 2025 22:37

andy-fong approved these changes Feb 13, 2025

View reviewed changes

Merge branch 'main' into add-ai-api-enhancement-proposal

a8b9b3e

npolshakova added 6 commits February 13, 2025 17:34

clean up examples, limit scope to extensionRefs

a2a8272

fix extensionRef

3f666c3

fmt

9a00936

reword

7c1daee

Merge branch 'main' into add-ai-api-enhancement-proposal

101daa4

Merge branch 'main' into add-ai-api-enhancement-proposal

f7cb884

npolshakova mentioned this pull request Feb 14, 2025

Add AI plugin #10627

Merged

4 tasks

lgadban reviewed Feb 14, 2025

View reviewed changes

andy-fong approved these changes Feb 14, 2025

View reviewed changes

Merge branch 'main' into add-ai-api-enhancement-proposal

db42a2a

EItanya approved these changes Feb 17, 2025

View reviewed changes

npolshakova added this pull request to the merge queue Feb 17, 2025

Merged via the queue into kgateway-dev:main with commit ab7a32c Feb 17, 2025
9 checks passed

npolshakova deleted the add-ai-api-enhancement-proposal branch February 17, 2025 22:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds EP-10494: AI APIs enhancement proposal #10495

Adds EP-10494: AI APIs enhancement proposal #10495

npolshakova commented Jan 23, 2025

EItanya left a comment

npolshakova commented Jan 28, 2025

andy-fong Feb 12, 2025

npolshakova Feb 12, 2025

lgadban Feb 14, 2025

npolshakova Feb 17, 2025

lgadban Feb 17, 2025

lgadban Feb 14, 2025

npolshakova Feb 17, 2025

lgadban Feb 17, 2025

EItanya left a comment

Adds EP-10494: AI APIs enhancement proposal #10495

Adds EP-10494: AI APIs enhancement proposal #10495

Conversation

npolshakova commented Jan 23, 2025

Description

EItanya left a comment

Choose a reason for hiding this comment

npolshakova commented Jan 28, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

EItanya left a comment

Choose a reason for hiding this comment