Skip to content

Commit ff2300a

Browse files
authored
Release Triton v25.09 (#5398)
Co-authored-by: Andrew Song <songand@amazon.com>
1 parent 50236ab commit ff2300a

File tree

2 files changed

+3
-3
lines changed

2 files changed

+3
-3
lines changed

available_images.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -181,7 +181,7 @@ instance_type="ml.g5.12xlarge"
181181
available_versions = list(image_uris.config_for_framework(triton_framework)['versions'].keys())
182182
image_uri = image_uris.retrieve(framework=triton_framework, region=region, instance_type=instance_type, version=available_versions[0])
183183
```
184-
3. Available versions: `25.04`, `24.09`, `24.05`, `24.03`, `24.01`, `23.12`
184+
3. Available versions: `25.09`, `25.04`, `24.09`, `24.05`, `24.03`, `24.01`, `23.12`
185185

186186
The Sagemaker Triton inference containers are built on top of the NGC containers with SageMaker support. To identify the python version and versions for other packages please refer to the corresponding official release notes for the specific version here: https://docs.nvidia.com/deeplearning/triton-inference-server/release-notes/index.html
187187

release_images_inference.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,13 +17,13 @@ release_images:
1717
# has already been published. Re-released image will have minor version incremented by 1.
1818
2:
1919
framework: "triton"
20-
version: "25.04"
20+
version: "25.09"
2121
arch_type: "x86"
2222
inference:
2323
device_types: [ "cpu", "gpu" ]
2424
python_versions: [ "py312" ]
2525
os_version: "ubuntu24.04"
26-
cuda_version: "cu129"
26+
cuda_version: "cu130"
2727
example: False # [Default: False] Set to True to denote that this image is an Example image
2828
disable_sm_tag: True # [Default: False] Set to True to prevent SageMaker Abbreviated Tags from being attached
2929
# to images being published.

0 commit comments

Comments
 (0)