ADD VPC Endpoint Support & Model Whitelisting by weisser-dev · Pull Request #231 · aws-samples/bedrock-access-gateway

weisser-dev · 2026-03-04T14:40:28Z

Motivation

Allow deploying the proxy to environments that require custom Bedrock endpoints (e.g., VPC interface endpoints / PrivateLink) by making Bedrock service endpoints configurable at runtime.
Provide a way to restrict which Bedrock models and cross-region profiles the proxy exposes and accepts so operators can limit available models by family and region.
Allow operators to supply the whitelist via environment (inline JSON) or a JSON file for flexible deployment configurations.
Ensure the /models discovery and request validation respect the configured whitelist so non-allowed models are effectively blocked.
Description

Description

Added BEDROCK_URL and BEDROCK_RUNTIME_URL environment variables in src/api/setting.py to expose endpoint overrides.
Passed the new settings into boto3 client initialization for bedrock and bedrock-runtime using the endpoint_url parameter in src/api/models/bedrock.py.
Updated deployment artifacts to propagate the new env vars: deployment/BedrockProxy.template, deployment/BedrockProxyFargate.template, and docker-compose.yml now include BEDROCK_URL and BEDROCK_RUNTIME_URL entries.
Documented optional env vars in README.md, docs/Usage.md, and docs/Usage_CN.md with examples and comments for VPC interface endpoints.
Added environment settings MODEL_WHITELIST_FILE and MODEL_WHITELIST_JSON in src/api/setting.py to accept whitelist configuration.
Implemented _load_model_whitelist() and _is_allowed_by_whitelist() in src/api/models/bedrock.py to read and evaluate the whitelist.
Applied whitelist filtering in list_bedrock_models() so the discovered model_list is reduced to allowed models (defaults to no filtering when no whitelist is provided).
Updated docs/Usage.md with examples showing how to configure MODEL_WHITELIST_FILE/MODEL_WHITELIST_JSON and a sample model-whitelist.json schema.
Testing

Testing

Ran Python bytecode compilation over the API package with python -m compileall src/api, which completed successfully.
Try it in our K8s Cluster with 4 Replicas and on production - works fine

Add configurable Bedrock endpoint URLs via environment variables

…nfiguration Add configurable Bedrock model whitelist filtering

zxkane

Thanks for the contribution — VPC endpoint support and model whitelisting are both genuinely useful features. The design is clean and the docs are well updated.

However, there is one bug that will cause a service crash in the default deployment configuration, so requesting changes before this can be merged.

…empty-urls

weisser-dev · 2026-03-06T06:22:05Z

Thx @zxkane for the feedback, should be fixed now, since we only use VPC Endpoints I not tried it out what happens without setting them, good points from your side!

zxkane

Thanks for the contribution! The VPC endpoint support and model whitelisting features are valuable additions. However, there are some important issues to address before merging:

Critical:

The whitelist fails open on misconfiguration — invalid JSON or missing file silently falls back to exposing all models. A security control must fail closed (raise at startup).

Important:

No schema validation on whitelist JSON — typos in key names (e.g., famlies instead of families) silently allow all models through.
CloudFormation templates hardcode empty strings instead of using !Ref parameters, inconsistent with the existing pattern.
Inconsistent or None handling between URL settings and whitelist settings.

Suggestions:

Add a warning/error log when the whitelist filters out ALL models (likely misconfiguration).
Consider basic URL validation for endpoint URLs (e.g., must start with https://).

See inline comments for details and suggested fixes.

zxkane · 2026-03-10T05:49:29Z

Hi @weisser-dev, thanks for addressing the first round of feedback — the or None fix, module-level whitelist caching, and simplified startswith check all look good now.

Just a heads-up: there are 5 open review threads from the second review that still need to be addressed before this can be merged:

Critical: _load_model_whitelist() fails open on misconfiguration — invalid JSON or missing file silently allows all models through. A security control should fail closed (raise at startup).
Important: No schema validation on whitelist JSON — typos in key names (e.g., famlies) or wrong value types silently allow all models.
Important: CloudFormation templates hardcode empty strings for BEDROCK_URL/BEDROCK_RUNTIME_URL instead of using !Ref parameters, inconsistent with the existing pattern.
Important: MODEL_WHITELIST_FILE and MODEL_WHITELIST_JSON don't use the or None pattern, inconsistent with the endpoint URL settings.
Suggestion: Log at error/warning level when the whitelist filters out ALL models (likely misconfiguration).

Please check the inline comments for details and suggested fixes. Let me know if you have any questions!

…claude-sonnet-4.6 Fix Claude Sonnet 4.5/4.6 ConverseStream validation failures

…rtant-review-feedback Harden model whitelist handling and parameterize Bedrock endpoint URLs

…ror-for-qwen-model Codex-generated pull request

merge main into our branch

weisser-dev · 2026-03-17T06:59:17Z

@zxkane i also fixed some bugs like: [ERROR] Bedrock validation error for model qwen.qwen3-235b-a22b-2507-v1:0: An error occurred (ValidationException) when calling the ConverseStream operation: This model doesn't support the stopSequences field. Remove stopSequences and try again.
and fixed sonnet bugs, cause in the usage with continue the newest models seems like be broken, so these fixes are here included... I guess this is a big advantage for your users, so maybe you could approve now? it works fine on our side... even if I provide a whitelisted model list, or not.

maybe as an info - we used it with 4 pods, on k8s to have access via vpc endpoints to bedrock models for ai assisted coding... and this is used by many people very very stable...

zxkane

Thanks for addressing all the previous review feedback — the whitelist validation, fail-closed behavior, and _env_or_none pattern all look solid now.

A few new items from the latest push:

Important (2):

_safe_text silently rewrites empty/whitespace messages with proxy-injected text — this changes model behavior without the caller knowing
Whitelist only filters /models discovery but not request-time validation — a user who knows a model ID can bypass the whitelist

Minor (3):

Substring matching in NO_STOP_SEQUENCES_MODELS (low risk but worth noting)
Unrelated bug fixes (prefill models, stop sequences, safe text) bundled with VPC/whitelist feature — consider separating commits
docker-compose.yml uses ${VAR:-} (empty string) instead of ${VAR-} (unset) — subtle coupling with _env_or_none

The VPC endpoint and whitelist core implementation look good and production-ready.

zxkane · 2026-03-17T08:12:10Z

+        def _safe_text(text: str | None) -> str:
+            if text is None:
+                return ""
+            return text if text.strip() else "[empty message omitted by proxy]"
+


Important: _safe_text silently mutates user input — this can change model behavior.

If a user intentionally sends " " or "", the proxy rewrites it to "[empty message omitted by proxy]" — a visible string the model will interpret as actual content. This changes the semantics of the request without the caller knowing.

A few concerns:

The replacement text "[empty message omitted by proxy]" will be treated as a real instruction/content by the model.

text: str | None — TextContent.text should never be None per the OpenAI chat completions schema. If there's a real case causing this, it would be good to document what client/scenario produces it.

Suggestion: either pass the empty string through and let Bedrock return a validation error naturally (so the caller can fix their request), or skip the empty content part entirely instead of injecting replacement text.

def _safe_text(text: str | None) -> str: return text if text else ""

zxkane · 2026-03-17T08:12:12Z

+            # Some models reject stopSequences entirely (ValidationException)
+            if any(no_stop_model in model_lower for no_stop_model in NO_STOP_SEQUENCES_MODELS):
+                if DEBUG:
+                    logger.info(f"Skipped stopSequences for {chat_request.model} (not supported by model)")
+            else:
+                inference_config["stopSequences"] = stop


Nit: Substring matching via in is fragile.

any(no_stop_model in model_lower ...) does a substring match. If a future model ID happens to contain qwen3-235b-a22b-2507-v1:0 as a substring, it would incorrectly match.

The existing patterns elsewhere in the codebase (e.g., TEMPERATURE_TOPP_CONFLICT_MODELS) use the same substring approach, and the full versioned ID with :0 makes accidental collision unlikely — but worth being aware of.

zxkane · 2026-03-17T08:12:13Z

 # For these models, if conversation ends with assistant message (e.g., "continue response"),
 # a user message will be added to ask the model to continue
 NO_ASSISTANT_PREFILL_MODELS = {


Note: These additions are unrelated to VPC endpoint / whitelist features.

Adding claude-sonnet-4-5 and claude-sonnet-4-6 to NO_ASSISTANT_PREFILL_MODELS is a legitimate fix, but bundling unrelated bug fixes with the VPC/whitelist feature makes the PR harder to review and git bisect. Consider splitting into separate commits at minimum.

zxkane · 2026-03-17T08:12:15Z

        # In case stack not updated.
        model_list[DEFAULT_MODEL] = {"modalities": ["TEXT", "IMAGE"]}

+    if whitelist:
+        model_list = {
+            model_id: metadata
+            for model_id, metadata in model_list.items()
+            if _is_allowed_by_whitelist(model_id, whitelist)
+        }
+        if not model_list:
+            logger.error("Model whitelist filtered out ALL models. Check whitelist configuration.")
+        else:


Suggestion: Whitelist only filters /models listing, not request-time validation.

The PR description states "request validation respect the configured whitelist so non-allowed models are effectively blocked", but _parse_request() / chat() in BedrockModel does not check _MODEL_WHITELIST. A user who knows a model ID can still send a chat request to a non-whitelisted model — the whitelist is only enforced on discovery.

If the whitelist is intended as a security control (restricting which models can be invoked), consider also adding a check in _parse_request():

if _MODEL_WHITELIST and not _is_allowed_by_whitelist(chat_request.model, _MODEL_WHITELIST): raise HTTPException(status_code=403, detail=f"Model {chat_request.model} is not allowed by whitelist")

zxkane · 2026-03-17T08:12:17Z

+      - BEDROCK_URL=${BEDROCK_URL:-}
+      - BEDROCK_RUNTIME_URL=${BEDROCK_RUNTIME_URL:-}


Minor: ${BEDROCK_URL:-} sets an empty string when unset, not unset.

This works today because _env_or_none() in setting.py converts "" to None. But it's a subtle coupling — if _env_or_none were ever changed, the docker-compose default would break.

Consider using ${BEDROCK_URL-} (without :) so the env var remains unset inside the container when not defined on the host, rather than being set to an empty string. This removes the dependency on _env_or_none() for correctness.

…rors-in-models Codex-generated pull request

weisser-dev added 4 commits March 4, 2026 08:20

Add configurable Bedrock endpoint URLs via env vars

3ca6f16

Merge pull request #1 from HUK-COBURG/codex/add-vic-endpoint-support

d578baf

Add configurable Bedrock endpoint URLs via environment variables

Add model whitelist configuration for Bedrock model exposure

9fec98b

Merge pull request #2 from HUK-COBURG/codex/create-model-whitelist-co…

5e5fc72

…nfiguration Add configurable Bedrock model whitelist filtering

weisser-dev changed the title ~~ADD VPC Endpoint Support~~ ADD VPC Endpoint Support & Model Whitelisting Mar 5, 2026

zxkane requested changes Mar 6, 2026

View reviewed changes

Comment thread src/api/setting.py Outdated

Comment thread src/api/models/bedrock.py Outdated

Comment thread src/api/models/bedrock.py Outdated

weisser-dev added 2 commits March 6, 2026 07:20

Fix Bedrock endpoint env defaults and cache whitelist loading

a607cc7

Merge pull request #3 from HUK-COBURG/codex/fix-service-crash-due-to-…

13eab9e

…empty-urls

zxkane requested changes Mar 6, 2026

View reviewed changes

Comment thread src/api/models/bedrock.py

Comment thread src/api/models/bedrock.py

Comment thread deployment/BedrockProxy.template Outdated

Comment thread src/api/setting.py Outdated

Comment thread src/api/models/bedrock.py Outdated

weisser-dev added 7 commits March 16, 2026 14:52

Fix Sonnet 4.5/4.6 Bedrock validation errors

80c068a

Merge pull request #4 from HUK-COBURG/codex/investigate-error-400-in-…

3929f94

…claude-sonnet-4.6 Fix Claude Sonnet 4.5/4.6 ConverseStream validation failures

Harden whitelist loading and parameterize Bedrock endpoint URLs

fcf14b6

Merge pull request #5 from HUK-COBURG/codex/address-critical-and-impo…

bfa7203

…rtant-review-feedback Harden model whitelist handling and parameterize Bedrock endpoint URLs

Fix Bedrock Qwen stopSequences ValidationException

82b4fc7

Merge pull request #6 from HUK-COBURG/codex/fix-bedrock-validation-er…

e476d05

…ror-for-qwen-model Codex-generated pull request

Merge pull request #7 from aws-samples/main

b5a73b5

merge main into our branch

zxkane reviewed Mar 17, 2026

View reviewed changes

weisser-dev added 2 commits March 17, 2026 17:03

Harden Bedrock request validation and model compatibility

0f1716e

Merge pull request #8 from HUK-COBURG/codex/fix-bedrock-validation-er…

02f317f

…rors-in-models Codex-generated pull request

		- BEDROCK_URL=${BEDROCK_URL:-}
		- BEDROCK_RUNTIME_URL=${BEDROCK_RUNTIME_URL:-}

Conversation

weisser-dev commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Description

Testing

Uh oh!

zxkane left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

weisser-dev commented Mar 6, 2026

Uh oh!

zxkane left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zxkane commented Mar 10, 2026

Uh oh!

weisser-dev commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zxkane left a comment

Choose a reason for hiding this comment

Uh oh!

zxkane Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

zxkane Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

zxkane Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

zxkane Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

zxkane Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

weisser-dev commented Mar 4, 2026 •

edited

Loading

weisser-dev commented Mar 17, 2026 •

edited

Loading