WG Batch: add 2025 annual report#8836
Conversation
d921e24 to
6a52b2a
Compare
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: tenzen-y The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
55e9a18 to
aa7eab4
Compare
Signed-off-by: Yuki Iwai <yuki.iwai.tz@gmail.com>
aa7eab4 to
bec8bf7
Compare
| KJob provides a template-based job execution with built-in SLURM support and kubectl plugin integration. | ||
| The HPC/ML community tend to prefer CLI over YAML so the focus was to provide a templated solution for submitting batch jobs and a smooth transition for Slurm users. | ||
|
|
||
| #### KEPs |
There was a problem hiding this comment.
Should we mention the engagement of WG-Batch with workload aware scheduling?
It started last release but it will continue in 2026.
There was a problem hiding this comment.
SGTM, we might be able to add another cross sigs / wgs collaboration section. wdyt?
| - [Job Managed By](https://github.com/kubernetes/enhancements/issues/4368) | ||
| - Promoted to stable. | ||
|
|
||
| - [Gang Scheduling / Workload API](https://github.com/kubernetes/enhancements/issues/4671) |
There was a problem hiding this comment.
This one is more of wg-batch consulting.
There was a problem hiding this comment.
Does that mean making another section like consulting or collaboration?
There was a problem hiding this comment.
https://github.com/kubernetes/community/pull/8836/changes#r2794073457
I think this minor update is fine.
There was a problem hiding this comment.
+1 on making that clear by a subsection or "(collaboration with sig-scheduling)" in brackets
There was a problem hiding this comment.
lgtm with the "consulted" added
kannon92
left a comment
There was a problem hiding this comment.
LGTM
just one minor nit.
|
LGTM, thank you for driving that 👍 |
andreyvelich
left a comment
There was a problem hiding this comment.
Looks great, thanks @tenzen-y!
I left a few comments.
| - [Mutable Container Resources for Suspended Jobs](https://github.com/kubernetes/enhancements/issues/5440) | ||
| - Introduced as alpha. | ||
|
|
||
| ### Talks |
There was a problem hiding this comment.
@tenzen-y You can also add our talk where we share how JobSet and TrainJob can be used for MPI workloads on k8s: https://youtu.be/Fnb1a5Kaxgo
There was a problem hiding this comment.
Thank you for raising that, yes sure.
|
|
||
| - [Release 0.15](https://github.com/kubernetes-sigs/kueue/releases/tag/v0.15.0) | ||
|
|
||
| In 2025, the Kueue community would like to highlight Topology Aware Scheduling, MultiKueue, Admission Fair Sharing, Elastic Jobs, DRA Integration, v1beta2 API and KueueViz Dashboard. |
There was a problem hiding this comment.
Shall we highlight the Kueue integration with TrainJob and SparkApplication? I think it’s a great example of ecosystem collaboration that we should definitely mention.
There was a problem hiding this comment.
I think that we can add TrainJob, surely. But not able to add SparkApplication since it is still ongoing (not merged).
|
What a year! Thank you everyone for the collaboration and thank you Yuki for driving this effort! |
Co-authored-by: Kevin Hannon <kehannon@redhat.com>
Signed-off-by: Yuki Iwai <yuki.iwai.tz@gmail.com>
Which issue(s) this PR fixes:
Fixes #8785