Feature Discussion: Audio Transcription Support #192
Replies: 1 comment
-
|
Hi there! 👋 I would strongly support this feature request as well. Use Case: I am currently experimenting with RAG workflows involving scientific lectures and podcasts. Having a native way to ingest audio files alongside text documents within RAG-Anything would be a massive improvement. It would streamline the workflow significantly by removing the need for external transcription tools. Regarding the dependencies: I agree that keeping the core package lightweight is important. Your proposed solution of using an optional dependency ( If the maintainers are open to it, I would love to see a PR for this feature! |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Use Case
I'm building a therapy practice management system and needed to process
voice notes alongside documents. Audio transcription → RAG enables
searching across mixed media (PDFs + voice notes).
Implementation
I've implemented audio support following RAG-Anything's patterns:
Questions
Code is ready if you're interested. Happy to adjust based on feedback.
Beta Was this translation helpful? Give feedback.
All reactions