Skip to content

Add Kubeflow Trainer v2.2 release blog post#194

Merged
google-oss-prow[bot] merged 14 commits intokubeflow:masterfrom
xikronz:add-trainer-v2.2-release-post
Mar 24, 2026
Merged

Add Kubeflow Trainer v2.2 release blog post#194
google-oss-prow[bot] merged 14 commits intokubeflow:masterfrom
xikronz:add-trainer-v2.2-release-post

Conversation

@xikronz
Copy link
Copy Markdown
Contributor

@xikronz xikronz commented Mar 20, 2026

This blog post (collaborative) covers the Kubeflow Trainer v2.2 release. For the full release tracking issue, see #3116

Copy link
Copy Markdown
Member

@andreyvelich andreyvelich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for this @xikronz !

Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md
@andreyvelich
Copy link
Copy Markdown
Member

@xikronz Please sign your commit too for DCO

@andreyvelich
Copy link
Copy Markdown
Member

/assign @astefanutti @jaiakash @varodrig @yashpal2104 @Fiona-Waters @robert-bell @kaisoz @Krishna-kg732 @vsoch @XploY04 @kramaranya

@google-oss-prow
Copy link
Copy Markdown
Contributor

@andreyvelich: GitHub didn't allow me to assign the following users: robert-bell, Krishna-kg732, vsoch, XploY04, yashpal2104.

Note that only kubeflow members with read permissions, repo collaborators and people who have commented on this issue/PR can be assigned. Additionally, issues/PRs can only have 10 assignees at the same time.
For more information please see the contributor guide

Details

In response to this:

/assign @astefanutti @jaiakash @varodrig @yashpal2104 @Fiona-Waters @robert-bell @kaisoz @Krishna-kg732 @vsoch @XploY04 @kramaranya

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
xikronz added 2 commits March 20, 2026 11:02
Signed-off-by: xikron <cc2864@cornell.edu>
Signed-off-by: xikron <cc2864@cornell.edu>
@xikronz xikronz force-pushed the add-trainer-v2.2-release-post branch from 58c2a2f to a99e675 Compare March 20, 2026 15:03
Co-authored-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
Signed-off-by: Carrie Chen <139071206+xikronz@users.noreply.github.com>
@andreyvelich
Copy link
Copy Markdown
Member

/ok-to-test

Co-authored-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
Signed-off-by: Carrie Chen <139071206+xikronz@users.noreply.github.com>
Copy link
Copy Markdown
Member

@andreyvelich andreyvelich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md
@yashpal2104
Copy link
Copy Markdown
Contributor

yashpal2104 commented Mar 20, 2026

I am not a kubeflow member, can someone make me a member not able to get assigned issues
@andreyvelich can u add me as a kubeflow member

@andreyvelich
Copy link
Copy Markdown
Member

I am not a kubeflow member, can someone make me a member not able to get assigned issues @andreyvelich can u add me as a kubeflow member

Sure, can you create PR similar to this one: kubeflow/internal-acls#895

xikronz added 2 commits March 20, 2026 13:34
Signed-off-by: xikron <cc2864@cornell.edu>
Signed-off-by: xikron <cc2864@cornell.edu>
@xikronz xikronz force-pushed the add-trainer-v2.2-release-post branch from e28c5a0 to 94c1974 Compare March 20, 2026 17:35
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
xikronz and others added 5 commits March 20, 2026 14:21
Co-authored-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
Signed-off-by: Carrie Chen <139071206+xikronz@users.noreply.github.com>
Signed-off-by: xikron <cc2864@cornell.edu>
Signed-off-by: xikron <cc2864@cornell.edu>
Signed-off-by: xikron <cc2864@cornell.edu>
Signed-off-by: xikron <cc2864@cornell.edu>
Copy link
Copy Markdown
Member

@andreyvelich andreyvelich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@google-oss-prow
Copy link
Copy Markdown
Contributor

@andreyvelich: GitHub didn't allow me to assign the following users: robert-bell, VassilisVassiliadis.

Note that only kubeflow members with read permissions, repo collaborators and people who have commented on this issue/PR can be assigned. Additionally, issues/PRs can only have 10 assignees at the same time.
For more information please see the contributor guide

Details

In response to this:

Thanks for this work @xikronz!
/lgtm
/assign @astefanutti @VassilisVassiliadis @kramaranya @Fiona-Waters @robert-bell @tenzen-y

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
Comment thread _posts/2026-3-20-introducing-kubeflow-trainer-v2.2.md Outdated
enabling them to treat GPUs across multiple machines as a single unified memory domain. For
large-scale training, this means significantly faster node-to-node communication compared to
standard network-based primitives and brings forth a new era of configurations that simply
weren't practical before on Kubernetes.
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's also say this

Suggested change
weren't practical before on Kubernetes.
weren't practical before on Kubernetes. We are working closely with Kubernetes community to introduce first class support for Dynamic Resource Allocation (DRA) in TrainJobs.

cc @Ronkahn21

process, Trainer will choose appropriate resources automatically based on the TrainJob configuration.
This gives teams the power to plan experiments with confidence and trust that jobs use just the right
amount of compute.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we also say something about Workload Aware Scheduling please?
kubeflow/trainer#3219

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

Co-authored-by: Vanessa Sochat <814322+vsoch@users.noreply.github.com>
Co-authored-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
Signed-off-by: Carrie Chen <139071206+xikronz@users.noreply.github.com>
@google-oss-prow google-oss-prow Bot removed the lgtm label Mar 21, 2026
@google-oss-prow
Copy link
Copy Markdown
Contributor

@vsoch: changing LGTM is restricted to collaborators

Details

In response to this:

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

xikronz added 2 commits March 21, 2026 13:49
Signed-off-by: xikron <cc2864@cornell.edu>
Signed-off-by: xikron <cc2864@cornell.edu>
Copy link
Copy Markdown
Member

@andreyvelich andreyvelich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Awesome work @xikronz!
I am going to announce the Trainer v2.2 release tomorrow.
/lgtm
/approve
/hold in case others want to give more comments.

@google-oss-prow
Copy link
Copy Markdown
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: andreyvelich

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Copy link
Copy Markdown
Member

@jaiakash jaiakash left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for this 🚀

@andreyvelich
Copy link
Copy Markdown
Member

/hold cancel

@google-oss-prow google-oss-prow Bot merged commit 83305b1 into kubeflow:master Mar 24, 2026
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.