Built-in Domain-Specific Custom Word Packs for Transcription #360
jorgedanisc
started this conversation in
Ideas
Replies: 1 comment 3 replies
-
|
Right now the app is not really sophisticated enough for this. If someone wants to implement and bring the feature in that's great and I welcome PRs, but this is a free app! Post processing was just implemented and the word correction feature built in is not quite good enough imo to support this level of usage |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I recently saw a demo from AquaVoice here. In the video, they mention achieving better accuracy for domain-specific recordings (for example, software engineering discussions with lots of technical terms).
In this project, there is already a "Custom Words" feature where users can manually add terms they want the model to recognize. My suggestion is to extend this with built-in, toggleable word packs for common domains. For example:
Each pack would contain a curated set of relevant terms and could be updated over time as new terminology appears. Users could enable or disable these packs as needed, while still adding their own custom words on top.
This would reduce the need for users to manually build large lists (e.g., hundreds of technical terms) and would help improve accuracy when working with domain-specific recordings.
Beta Was this translation helpful? Give feedback.
All reactions