feat: configure default max_tokens for anthropic translator by herewasmike · Pull Request #1933 · envoyproxy/ai-gateway

herewasmike · 2026-03-09T22:56:32Z

Description

max_tokens and max_completion_tokens are optional in openAI spec
https://developers.openai.com/api/reference/resources/chat/subresources/completions/methods/create

However when I try to send the request from the source I get 422 if my AIServiceBackend has AWSAnthropic schema.
It's impossible to add this field with bodyMutation due to ordering (I've considered mutating body before the translation but it seems to be a whole redesign rather than small fix with providing default)

Special notes for reviewers (if applicable)

Per gen AI policy I disclose that claude did help me with setting up this PR

Signed-off-by: Mikhail Toldov <matoldov@gmail.com>

dosubot · 2026-03-09T22:58:55Z

Related Documentation

3 document(s) may need updating based on files changed in this PR:

Envoy's Space

gcp-vertexai `/ai-gateway/blob/main/site/docs/getting-started/connect-providers/gcp-vertexai.md`

View Suggested Changes

@@ -104,6 +104,10 @@
   $GATEWAY_URL/v1/chat/completions
 ```
 
+:::note
+The `max_completion_tokens` parameter (or `max_tokens`) is optional and defaults to 4096 if not specified. The example above includes it to demonstrate setting an explicit limit.
+:::
+
 Expected output:
 
 ```json

[Accept] [Decline]

gcp-vertexai `/ai-gateway/blob/main/site/versioned_docs/version-0.4/getting-started/connect-providers/gcp-vertexai.md`

View Suggested Changes

@@ -101,6 +101,10 @@
   $GATEWAY_URL/v1/chat/completions
 ```
 
+:::note
+The `max_completion_tokens` parameter is optional and defaults to 4096 if not specified. It's recommended to set it explicitly to control response length and costs.
+:::
+
 Expected output:
 
 ```json
@@ -136,6 +140,10 @@
   }' \
   $GATEWAY_URL/anthropic/v1/messages
 ```
+
+:::note
+The `max_tokens` parameter is optional and defaults to 4096 if not specified. It's recommended to set it explicitly to control response length and costs.
+:::
 
 ## Troubleshooting

[Accept] [Decline]

gcp-vertexai `/ai-gateway/blob/main/site/versioned_docs/version-0.5/getting-started/connect-providers/gcp-vertexai.md`

View Suggested Changes

@@ -104,6 +104,10 @@
   $GATEWAY_URL/v1/chat/completions
 ```
 
+:::note
+The `max_completion_tokens` parameter is optional. If not specified, it defaults to 4096 tokens.
+:::
+
 Expected output:
 
 ```json
@@ -139,6 +143,10 @@
   }' \
   $GATEWAY_URL/anthropic/v1/messages
 ```
+
+:::note
+The `max_tokens` parameter is optional. If not specified, it defaults to 4096 tokens.
+:::
 
 ## Troubleshooting

[Accept] [Decline]

Note: You must be authenticated to accept/decline updates.

^{How did I do? Any feedback?}

Signed-off-by: Mikhail Toldov <matoldov@gmail.com>

herewasmike · 2026-03-09T23:39:39Z

I'm not sure if arbitrary default value + bodyMutator is the best approach.
Though, no idea how anthropic api reacts if supplied with max_tokens value greater than the model supports.
Happy to take suggestions here

codecov-commenter · 2026-03-10T00:29:34Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 84.33%. Comparing base (2d35d43) to head (ef7a290).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1933      +/-   ##
==========================================
- Coverage   84.33%   84.33%   -0.01%     
==========================================
  Files         130      130              
  Lines       17987    17986       -1     
==========================================
- Hits        15170    15169       -1     
  Misses       1873     1873              
  Partials      944      944

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

nutanix-Hrushikesh · 2026-03-10T10:01:43Z

Though, no idea how anthropic api reacts if supplied with max_tokens value greater than the model supports.

it returns 400 Bad Request

johnugeorge · 2026-03-15T13:09:45Z

This might be a bit confusing if low default is observed by default. Isn't it better for the client to be aware of and use the right values? Are you using a client where it is not configurable?

herewasmike · 2026-03-15T13:39:32Z

Yes, I'm connecting 3rd party code in this particular case so it's not easy to change the client's behavior.
I'll rework this PR following up the conversation on slack, though, to avoid the confusion

Simply allow requests to pass through and fail on the provider side (one would be able to detect it and mutate request body)

Signed-off-by: Mikhail Toldov <matoldov@gmail.com>

herewasmike · 2026-03-31T16:25:39Z

/retest

herewasmike requested a review from a team as a code owner March 9, 2026 22:56

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Mar 9, 2026

feat: configure default max_tokens for anthropic translator

dbd525e

Signed-off-by: Mikhail Toldov <matoldov@gmail.com>

herewasmike force-pushed the openai_awsanthropic_translation branch from 0247bb7 to dbd525e Compare March 9, 2026 22:56

Merge branch 'main' into openai_awsanthropic_translation

b3c3758

Modify dataplane test to expect default value

e79d6b0

Signed-off-by: Mikhail Toldov <matoldov@gmail.com>

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. and removed size:S This PR changes 10-29 lines, ignoring generated files. labels Mar 9, 2026

herewasmike added 2 commits March 18, 2026 23:00

Set default max_tokens as 0 to fail on provider

43a25ce

Signed-off-by: Mikhail Toldov <matoldov@gmail.com>

Merge branch 'main' into openai_awsanthropic_translation

607993e

herewasmike force-pushed the openai_awsanthropic_translation branch from 284e5be to b5efeb0 Compare March 18, 2026 23:02

dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. and removed size:M This PR changes 30-99 lines, ignoring generated files. labels Mar 18, 2026

herewasmike force-pushed the openai_awsanthropic_translation branch from b5efeb0 to 284e5be Compare March 18, 2026 23:04

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. and removed size:XXL This PR changes 1000+ lines, ignoring generated files. labels Mar 18, 2026

herewasmike added 2 commits March 19, 2026 00:06

Add aws-anthropic test for v1/chat/completions

5f020f7

Signed-off-by: Mikhail Toldov <matoldov@gmail.com>

Validate max_tokens field in translator unittests

2810ba3

Signed-off-by: Mikhail Toldov <matoldov@gmail.com>

herewasmike force-pushed the openai_awsanthropic_translation branch from 284e5be to 2810ba3 Compare March 18, 2026 23:06

herewasmike added 3 commits March 20, 2026 12:43

Merge branch 'main' into openai_awsanthropic_translation

174ae5d

Merge branch 'main' into openai_awsanthropic_translation

2088589

Merge branch 'main' into openai_awsanthropic_translation

ef7a290

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: configure default max_tokens for anthropic translator#1933

feat: configure default max_tokens for anthropic translator#1933
herewasmike wants to merge 10 commits intoenvoyproxy:mainfrom
herewasmike:openai_awsanthropic_translation

herewasmike commented Mar 9, 2026

Uh oh!

dosubot bot commented Mar 9, 2026

Uh oh!

herewasmike commented Mar 9, 2026

Uh oh!

codecov-commenter commented Mar 10, 2026 •

edited

Loading

Uh oh!

nutanix-Hrushikesh commented Mar 10, 2026

Uh oh!

johnugeorge commented Mar 15, 2026

Uh oh!

herewasmike commented Mar 15, 2026

Uh oh!

herewasmike commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

herewasmike commented Mar 9, 2026

Uh oh!

dosubot bot commented Mar 9, 2026

gcp-vertexai /ai-gateway/blob/main/site/docs/getting-started/connect-providers/gcp-vertexai.md

gcp-vertexai /ai-gateway/blob/main/site/versioned_docs/version-0.4/getting-started/connect-providers/gcp-vertexai.md

gcp-vertexai /ai-gateway/blob/main/site/versioned_docs/version-0.5/getting-started/connect-providers/gcp-vertexai.md

Uh oh!

herewasmike commented Mar 9, 2026

Uh oh!

codecov-commenter commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

nutanix-Hrushikesh commented Mar 10, 2026

Uh oh!

johnugeorge commented Mar 15, 2026

Uh oh!

herewasmike commented Mar 15, 2026

Uh oh!

herewasmike commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gcp-vertexai `/ai-gateway/blob/main/site/docs/getting-started/connect-providers/gcp-vertexai.md`

gcp-vertexai `/ai-gateway/blob/main/site/versioned_docs/version-0.4/getting-started/connect-providers/gcp-vertexai.md`

gcp-vertexai `/ai-gateway/blob/main/site/versioned_docs/version-0.5/getting-started/connect-providers/gcp-vertexai.md`

codecov-commenter commented Mar 10, 2026 •

edited

Loading