Skip to content

feat: update sherpa-onnx to v1.12.23 with new ASR models#127

Open
zonghaoyuan wants to merge 2 commits into
thewh1teagle:mainfrom
zonghaoyuan:update-sherpa-v1.12.23
Open

feat: update sherpa-onnx to v1.12.23 with new ASR models#127
zonghaoyuan wants to merge 2 commits into
thewh1teagle:mainfrom
zonghaoyuan:update-sherpa-v1.12.23

Conversation

@zonghaoyuan
Copy link
Copy Markdown

Summary

Update sherpa-onnx from v1.12.9 to v1.12.23 and add support for three new ASR recognizer models.

Changes

1. Update sherpa-onnx to v1.12.23

  • Update submodule to v1.12.23
  • Update checksums for new release binaries
  • Update ZipVoice TTS field names to match new API:
    • text_modelencoder
    • flow_matching_modeldecoder
    • pinyin_dictlexicon

2. Add new ASR recognizer modules

Module Description Feature Flag
Omnilingual 1600+ language ASR support with auto language detection omnilingual
MedASR Google medical speech recognition medasr
FunASR Nano ASR with LLM support (Qwen3-0.6B based) funasr-nano

3. Files changed

  • crates/sherpa-rs-sys/sherpa-onnx - Submodule updated to v1.12.23
  • crates/sherpa-rs-sys/dist.json - Version updated
  • crates/sherpa-rs-sys/checksum.txt - Checksums updated for new binaries
  • crates/sherpa-rs/src/omnilingual.rs - New Omnilingual recognizer
  • crates/sherpa-rs/src/medasr.rs - New MedASR recognizer
  • crates/sherpa-rs/src/funasr_nano.rs - New FunASR Nano recognizer
  • crates/sherpa-rs/src/tts/zipvoice.rs - Updated field names
  • examples/omnilingual.rs - Example usage
  • examples/medasr.rs - Example usage
  • examples/funasr_nano.rs - Example usage

Testing

All three new models have been tested successfully:

  • FunASR Nano: Japanese audio transcription ✓
  • Omnilingual: Japanese audio transcription ✓
  • Dolphin (existing): Japanese audio transcription ✓

Breaking Changes

  • ZipVoice TTS: Field names changed in ZipVoiceConfig:
    • text_modelencoder
    • flow_matching_modeldecoder
    • pinyin_dictlexicon

🤖 Generated with Claude Code

zonghaoyuan and others added 2 commits January 16, 2026 17:06
- Update submodule to v1.12.23
- Add new ASR model config fields: omnilingual, medasr, funasr_nano
- Update ZipVoice TTS field names: text_model->encoder, flow_matching_model->decoder, pinyin_dict->lexicon
- Update checksums for new release binaries

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Add three new ASR recognizer modules for sherpa-onnx v1.12.23:
- Omnilingual: 1600+ language ASR support
- MedASR: Google medical speech recognition
- FunASR Nano: ASR with LLM support

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant