Skip to content

Conversation

@zren11
Copy link

@zren11 zren11 commented Sep 15, 2025

What this PR does / why we need it:
This is an simple Speech Recognition example for Issue 2040.

Key Features:

  • Complete end-to-end workflow in example.ipynb - from data preparation to distributed training
  • PyTorch DDP integration with Kubeflow Trainer for multi-node speech recognition
  • Transformer-based model (4 layers, 4 attention heads) for audio classification
  • Use Google Speech Command dataset - 35 words, 105K+ 1-second audio files for classification
  • Local to distributed progression - starts with local training, scales to Kubernetes

Details can be found in PR's README.md file.

Which issue(s) this PR fixes (optional, in Fixes #<issue number>, #<issue number>, ... format, will close the issue(s) when PR gets merged):
Fixes #Issue 2040 Issue 2040.

@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@google-oss-prow
Copy link

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign andreyvelich for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@zren11 zren11 changed the title Add Speech Recognition with DDP Example chore: Add Speech Recognition with DDP Example Sep 15, 2025
@coveralls
Copy link

coveralls commented Sep 15, 2025

Pull Request Test Coverage Report for Build 17773026722

Warning: This coverage report may be inaccurate.

This pull request's base commit is no longer the HEAD commit of its target branch. This means it includes changes from outside the original pull request, including, potentially, unrelated coverage changes.

Details

  • 0 of 0 changed or added relevant lines in 0 files are covered.
  • No unchanged relevant lines lost coverage.
  • Overall coverage remained the same at 55.137%

Totals Coverage Status
Change from base Build 17772927799: 0.0%
Covered Lines: 1084
Relevant Lines: 1966

💛 - Coveralls

@zren11 zren11 force-pushed the feat/example/add-speech-recognition-with-ddp-example branch from abad5ad to 277e1b3 Compare September 16, 2025 16:17
@zren11 zren11 force-pushed the feat/example/add-speech-recognition-with-ddp-example branch from 277e1b3 to 7754621 Compare September 16, 2025 16:50
@zren11
Copy link
Author

zren11 commented Sep 19, 2025

@andreyvelich I notice the GPU E2E test failed. Is there anything I can do to fix it? Thanks!

@jaiakash
Copy link
Member

/ok-to-test

@andreyvelich
Copy link
Member

@andreyvelich I notice the GPU E2E test failed. Is there anything I can do to fix it? Thanks!

Can you rebase your PR again please ?

@@ -0,0 +1,5 @@
FROM pytorch/pytorch:2.8.0-cuda12.8-cudnn9-devel
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You can remove all of your files and keep only Jupyter Notebooks.
You can check out this example for knowledge distillation BERT fine-tuning: https://github.com/kubeflow/trainer/blob/master/examples/pytorch/question-answering/fine-tune-distilbert.ipynb
In your Notebook you can reference to the getting started guide which explains how to deploy kind cluster and torch-distributed Runtime.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants