Skip to content

Commit 03e2a7a

Browse files
authored
chore(runtimes): Bump Torch to 2.9.1 version (kubeflow#3093)
Signed-off-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
1 parent 766d970 commit 03e2a7a

6 files changed

Lines changed: 6 additions & 6 deletions

File tree

cmd/runtimes/deepspeed/requirements.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# Keep the same version as for Torch runtime.
2-
torch==2.7.1
2+
torch==2.9.1
33
# DeepSpeed libraries.
44
deepspeed==0.18.4
55
mpi4py==4.1.1

examples/pytorch/image-classification/mnist.ipynb

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,7 @@
5050
"metadata": {},
5151
"outputs": [],
5252
"source": [
53-
"!pip install torch==2.7.1\n",
53+
"!pip install torch==2.9.1\n",
5454
"!pip install torchvision==0.22.1"
5555
]
5656
},

hack/e2e-setup-cluster.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -93,7 +93,7 @@ kubectl apply --server-side -k manifests/overlays/runtimes || (
9393
)
9494

9595
# TODO (andreyvelich): We should build runtime images before adding them.
96-
TORCH_RUNTIME_IMAGE=pytorch/pytorch:2.7.1-cuda12.8-cudnn9-runtime
96+
TORCH_RUNTIME_IMAGE=pytorch/pytorch:2.9.1-cuda12.8-cudnn9-runtime
9797
DEEPSPEED_RUNTIME_IMAGE=ghcr.io/kubeflow/trainer/deepspeed-runtime:latest
9898

9999
# Load Torch runtime image in KinD

hack/e2e-setup-gpu-cluster.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -160,7 +160,7 @@ kubectl apply --server-side -k "${E2E_RUNTIMES_DIR}" || (
160160
)
161161

162162
# TODO (andreyvelich): Discuss how we want to pre-load runtime images to the Kind cluster.
163-
TORCH_RUNTIME_IMAGE=pytorch/pytorch:2.7.1-cuda12.8-cudnn9-runtime
163+
TORCH_RUNTIME_IMAGE=pytorch/pytorch:2.9.1-cuda12.8-cudnn9-runtime
164164
${CONTAINER_RUNTIME} pull ${TORCH_RUNTIME_IMAGE}
165165
load_image_to_kind ${TORCH_RUNTIME_IMAGE} ${GPU_CLUSTER_NAME}
166166

manifests/base/runtimes/data-cache/torch_distributed_with_cache.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,7 @@ spec:
4646
spec:
4747
containers:
4848
- name: node
49-
image: pytorch/pytorch:2.7.1-cuda12.8-cudnn9-runtime
49+
image: pytorch/pytorch:2.9.1-cuda12.8-cudnn9-runtime
5050
env:
5151
- name: TRAIN_JOB_NAME
5252
valueFrom:

manifests/base/runtimes/torch_distributed.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,4 +22,4 @@ spec:
2222
spec:
2323
containers:
2424
- name: node
25-
image: pytorch/pytorch:2.7.1-cuda12.8-cudnn9-runtime
25+
image: pytorch/pytorch:2.9.1-cuda12.8-cudnn9-runtime

0 commit comments

Comments
 (0)