Real time audio transcription with OpenAI's Whisper

This issue is to track the support of the AES67 daemon for real-time transcription of audio streams  using [OpenAI's Whisper](https://github.com/openai/whisper), integrated through [Whisper.cpp](https://github.com/ggml-org/whisper.cpp), a high-performance C/C++ inference of Whisper. 
The transcription feature enables speech-to-text conversion of daemon's configured Sinks with good robustness and accuracy, making it a valuable addition for multimedia and broadcast applications.
Audio transcription feature has been integrated while maintaining robust performance in multi-sink setups by leveraging a multi-threaded architecture.
See branch [asr-whisper](https://github.com/bondagit/aes67-linux-daemon/tree/asr-whisper)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Real time audio transcription with OpenAI's Whisper #219

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Real time audio transcription with OpenAI's Whisper #219

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions