Skip to content

Commit d47ab5f

Browse files
skamenan7leseb
authored andcommitted
Add Bedrock provider support to inference configuration
Added remote::bedrock provider to both build.yaml and run.yaml with AWS configuration options including credentials, region, retry settings, and connection timeouts.
1 parent 4d50ebe commit d47ab5f

File tree

3 files changed

+15
-0
lines changed

3 files changed

+15
-0
lines changed

distribution/Containerfile

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -8,6 +8,7 @@ RUN pip install sqlalchemy # somehow sqlalchemy[asyncio] is not sufficient
88
RUN pip install \
99
aiosqlite \
1010
autoevals \
11+
boto3 \
1112
chardet \
1213
datasets \
1314
fastapi \

distribution/build.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,7 @@ distribution_spec:
44
providers:
55
inference:
66
- provider_type: remote::vllm
7+
- provider_type: remote::bedrock
78
- provider_type: inline::sentence-transformers
89
vector_io:
910
- provider_type: inline::milvus

distribution/run.yaml

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,19 @@ providers:
1919
max_tokens: ${env.VLLM_MAX_TOKENS:=4096}
2020
api_token: ${env.VLLM_API_TOKEN:=fake}
2121
tls_verify: ${env.VLLM_TLS_VERIFY:=true}
22+
- provider_id: bedrock-inference
23+
provider_type: remote::bedrock
24+
config:
25+
aws_access_key_id: ${env.AWS_ACCESS_KEY_ID:=}
26+
aws_secret_access_key: ${env.AWS_SECRET_ACCESS_KEY:=}
27+
aws_session_token: ${env.AWS_SESSION_TOKEN:=}
28+
region_name: ${env.AWS_DEFAULT_REGION:=}
29+
profile_name: ${env.AWS_PROFILE:=}
30+
total_max_attempts: ${env.AWS_MAX_ATTEMPTS:=}
31+
retry_mode: ${env.AWS_RETRY_MODE:=}
32+
connect_timeout: ${env.AWS_CONNECT_TIMEOUT:=60}
33+
read_timeout: ${env.AWS_READ_TIMEOUT:=60}
34+
session_ttl: ${env.AWS_SESSION_TTL:=3600}
2235
- provider_id: sentence-transformers
2336
provider_type: inline::sentence-transformers
2437
config: {}

0 commit comments

Comments
 (0)