Highlights
- Improved preprocessor performance: added a NumPy version for CPU (up to 10x speedup) and updated ONNX versions with DirectML/WebGPU support.
- Updated Silero VAD to current version and added PyAnnote VAD implementation.
- Updated benchmarks, comparisons and installation guide in docs.
- Removed PyTorch from build time dependencies.
- Added Canary 1B Flash model.
What's Changed
- Refactoring and updating model loader by @istupakov in #71, #72, #74
- Updated preprocessors, added NumPy preprocessors by @istupakov in #81, #92
- Updated deps and CI by @istupakov in #70, #76, #77, #89, #90, #97, #100, #103
- Add WeSpeaker embeddings model by @istupakov in #86
- Move content from Readme to Docs by @istupakov in #91
- Update benchmarks and comparison with original models by @istupakov in #85, #98, #105
- Update Silero VAD to v6.2 by @istupakov in #107
- Add PyAnnote VAD implementation by @FTWsGit in #96
- Fix missed logprobs in VAD recognize_batch by @istupakov in #102
- Fix Nemo Canary model decoding by @istupakov in #106
- Other small fixes by @istupakov in #83, #87, #94, #108
New Contributors
Full Changelog: v0.10.2...v0.11.0