TransducerRecognizer support by rdcm · Pull Request #98 · thewh1teagle/sherpa-rs

rdcm · 2025-05-04T19:37:32Z

Hi! I'm currently migrating our speech-to-text service from Python to Rust. Our service relies on the from_transducer API from the original library, and it would be great to have the same API available in sherpa-rs. I still need to add a test, which might take a bit more time. In the meantime, if you have any comments or suggestions regarding the PR, I'm happy to discuss and make changes as needed.

TODO:
~~- tests~~ (done)

Run tests:

cargo test transducer

Prerequisites:

zipformer2 model

rdcm · 2025-05-05T12:21:10Z

@thewh1teagle test added, pr ready for review

thewh1teagle · 2025-05-09T16:00:14Z

Hi @rdcm
Can you omit the test and add example instead?
Let me know then if the example works :)

thewh1teagle · 2025-05-09T16:00:26Z

Follow the other examples template you can copy paste and change a bit

rdcm · 2025-05-09T16:45:15Z

@thewh1teagle done, example added :)

thewh1teagle · 2025-05-10T23:22:24Z

Two things I noticed:
(1) when filling sherpa structs, instead of filling every inner struct with nulls we can use mem::zero<_>() it's very cool pattern I used in other files, this way we don't need many structs, the downside is that we need to create the main struct in unsafe block, but it's not big deal imo

(2) in the example instead of filling nulls, you can use ..Default::default() like in other examples

thewh1teagle · 2025-05-10T23:22:56Z

Feel free to leave it as is, I'll merge it anyway soon : )

rdcm · 2025-05-11T20:38:16Z

@thewh1teagle

Interesting approach, but not used across whole project, for example in whisper.rs and zipformer.rs nulls passed to structs explicitly - done.
Done.

Example still works fine :)

rdcm · 2025-05-11T21:30:18Z

Also added more detailed example.

thewh1teagle · 2025-05-12T00:35:04Z

Looks great. Thank you!

rdcm · 2025-06-18T20:51:50Z

examples/transducer_vosk.rs

+    }
+
+    let config = TransducerConfig {
+        decoder: "decoder.onnx".to_string(),


@thewh1teagle Hi, do you know if there is any way to pass &[u8] instead of a file path to the model?
Currently, we use a workaround with a memfd_create call to get a file descriptor for the memory and then pass the path to that descriptor.

However, ONNX Runtime provides a more convenient API, for example:

Session::builder() .... .commit_from_memory(model);

rdcm added 5 commits May 4, 2025 21:49

ignore

00aebae

transducer recognizer

07aa3ac

test stub

5768ed9

fix test name

ccc9a41

add test

0c34d7f

add example instead test

7f2a6fc

cleanup .gitignore

8c0aa8c

review fixes

fa17d98

vosk example added

f398854

thewh1teagle merged commit 39b5cb7 into thewh1teagle:main May 12, 2025
3 checks passed

rdcm commented Jun 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TransducerRecognizer support#98

TransducerRecognizer support#98
thewh1teagle merged 9 commits intothewh1teagle:mainfrom
rdcm:from_transducer_example

rdcm commented May 4, 2025 •

edited

Loading

Uh oh!

rdcm commented May 5, 2025

Uh oh!

thewh1teagle commented May 9, 2025

Uh oh!

thewh1teagle commented May 9, 2025

Uh oh!

rdcm commented May 9, 2025

Uh oh!

thewh1teagle commented May 10, 2025

Uh oh!

thewh1teagle commented May 10, 2025

Uh oh!

rdcm commented May 11, 2025

Uh oh!

rdcm commented May 11, 2025

Uh oh!

thewh1teagle commented May 12, 2025

Uh oh!

Uh oh!

rdcm Jun 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rdcm commented May 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rdcm commented May 5, 2025

Uh oh!

thewh1teagle commented May 9, 2025

Uh oh!

thewh1teagle commented May 9, 2025

Uh oh!

rdcm commented May 9, 2025

Uh oh!

thewh1teagle commented May 10, 2025

Uh oh!

thewh1teagle commented May 10, 2025

Uh oh!

rdcm commented May 11, 2025

Uh oh!

rdcm commented May 11, 2025

Uh oh!

thewh1teagle commented May 12, 2025

Uh oh!

Uh oh!

rdcm Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rdcm commented May 4, 2025 •

edited

Loading