Skip to content

Commit 88eec90

Browse files
committed
add notes in readme
1 parent fbdb088 commit 88eec90

File tree

1 file changed

+4
-1
lines changed

1 file changed

+4
-1
lines changed

README.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,10 +6,11 @@ WhisperS2T is an optimized lightning-fast speech-to-text pipeline tailored for t
66

77
## Benchmark and Technical Report
88

9-
Stay tuned for a technical report comparing WhisperS2T against other whisper pipelines. Meanwhile, check some quick benchmarks on A30 GPU.
9+
Stay tuned for a technical report comparing WhisperS2T against other whisper pipelines. Meanwhile, check some quick benchmarks on A30 GPU. See `scripts/` directory for the benchmarking scripts that I used.
1010

1111
![A30 Benchmark](files/benchmarks.png)
1212

13+
**NOTE:** I ran all the benchmarks with `without_timestamps` parameter as `True`. Setting `without_timestamps` as `False` may improve the WER of HuggingFace pipiline at the expense of additional inference time.
1314

1415
## Features
1516

@@ -77,6 +78,8 @@ print(out[0][0])
7778

7879
Check this [Documentation](docs.md) for more details.
7980

81+
**NOTE:** For first run the model may give slightly slower inference speed. After 1-2 runs it will give better inference speed. This is due to the JIT tracing of the VAD model.
82+
8083

8184
## Acknowledgements
8285
- [**OpenAI Whisper Team**](https://github.com/openai/whisper): Thanks to the OpenAI Whisper Team for open-sourcing the whisper model.

0 commit comments

Comments
 (0)