Skip to content

Releases: istupakov/onnx-asr

onnx-asr v0.11.0

23 Mar 02:30
Immutable release. Only release title and notes can be modified.

Choose a tag to compare

Highlights

  • Improved preprocessor performance: added a NumPy version for CPU (up to 10x speedup) and updated ONNX versions with DirectML/WebGPU support.
  • Updated Silero VAD to current version and added PyAnnote VAD implementation.
  • Updated benchmarks, comparisons and installation guide in docs.
  • Removed PyTorch from build time dependencies.
  • Added Canary 1B Flash model.

What's Changed

New Contributors

Full Changelog: v0.10.2...v0.11.0

onnx-asr v0.10.2

18 Jan 19:55

Choose a tag to compare

What's Changed

  • Update build, CI, tests and dependencies by @istupakov in #66, #67, #69
  • Add Docs via Material for MkDocs and GitHub Pages, update docstrings, Readme and CI by @istupakov in #68

Full Changelog: v0.10.1...v0.10.2

onnx-asr v0.10.1

30 Dec 22:17
Immutable release. Only release title and notes can be modified.

Choose a tag to compare

What's Changed

Full Changelog: v0.10.0...v0.10.1

onnx-asr v0.10.0

26 Dec 04:51

Choose a tag to compare

Highlights

  • Improved TensorRT support: 10x speedup on GigaAM v2/v3 CTC and 3x speedup on Parakeet TDT v2/v3!
  • Optimized the default ONNX configuration to automatically exclude unsupported providers.
  • Installation from source has been simplified - you can now just use pip / uv.
  • You can specify the folder where to download the model.
  • The cpu_preprocessing option is deprecated and ignored, use preprocessor_config and resampler_config instead.

What's Changed

Full Changelog: v0.9.1...v0.10.0

onnx-asr v0.9.1

08 Dec 00:01

Choose a tag to compare

What's Changed

  • Added separate ONNX options for preprocessor and resampler (preprocessor_config and resampler_config) by @istupakov in #48
  • Added concurrent processing in preprocessor (set max_concurrent_workers in preprocessor_config) by @istupakov in #47, #48
  • Fixed VAD for 8 kHz models (t-tech/t-one) by @istupakov in #46
  • CTC decoding optimization by @istupakov in #47

Full Changelog: v0.9.0...v0.9.1

onnx-asr v0.9.0

04 Dec 18:50

Choose a tag to compare

Highlights

What's Changed

Full Changelog: v0.8.0...v0.9.0

onnx-asr v0.8.0

27 Nov 00:36

Choose a tag to compare

Highlights

  • Added support for new GigaAM v3 models, including E2E versions with punctuation and text normalization!

What's Changed

Full Changelog: v0.7.0...v0.8.0

onnx-asr v0.7.0

16 Aug 22:16

Choose a tag to compare

Highlights

  • New supported model nemo-parakeet-tdt-0.6b-v3 for Nvidia Parakeet TDT 0.6B V3 - new multilingual Parakeet model from Nvidia!
  • Performance optimization for GigaAM v2 RNN-T and WhisperHf models.
  • Added performance benchmarks in Readme (on Arm, Intel, and Nvidia T4).

What's Changed

Full Changelog: v0.6.1...v0.7.0

onnx-asr v0.6.1

24 May 16:18

Choose a tag to compare

What's Changed

  • New supported sample rates (24 and 32 kHz)
  • Set cpu_preprocessing to true by default
  • Preprocessors optimization by @istupakov in #10
  • Update Readme by @istupakov in #12

Full Changelog: v0.6.0...v0.6.1

onnx-asr v0.6.0

10 May 17:41

Choose a tag to compare

Highlights

  • New supported model nemo-parakeet-tdt-0.6b-v2 for Nvidia Parakeet TDT 0.6B V2 (en) - the best English model according to open_asr_leaderboard at the moment
  • load_model now supports loading from any Hugging Face repository with a compatible config.json
  • All models now have a quantized version (quantization="uint8" for onnx-community models and quantization="int8" for others)

What's Changed

  • Improved typing, updated CLI by @istupakov in #5
  • Optimized preprocessors on GPU and added cpu_preprocessing option to load_model by @istupakov in #6
  • Added support for Nemo TDT models by @istupakov in #4
  • Added config.json support by @istupakov in #8
  • Improved tests and typings by @istupakov in #9
  • [Breaking Changes] Renamed model type whisper-hf to whisper (for compatibility with config.json from onnx-community)

Full Changelog: v0.5.0...v0.6.0