Skip to content

feature request: use wit.ai speech to text and deepl/open ai to transtate it #11

@Zhen-Bo

Description

@Zhen-Bo

Feature Request

Description of the feature you'd like:

Want to use the user's own wit.ai and deepl API key for real-time speech-to-text translation.

Feature Background:

After using it for a while, I found that there is often a translation delay issue (interval=3~5) when using the medium model.
It also frequently results in blank spaces.

I don't know if it's due to the delay in voice recognition or incorrect identification of language type that causes the translation failure.

And English is not my native language. After receiving English, I need to spend some time converting it into my native language. So I hope to increase the variety of translation languages.

Proposed Solution

  • speech-to-text: Use wit.ai to convert audio files into text wit.ai docs

    • Free to use
    • Users can customize the unique language corresponding to the API token, so as not to cause incorrect language identification.
    • The recognition speed is very fast and accurate.
      (I use it to identify Google reCAPTCHA voice verification, which is very fast and accurate.)
  • transalte: use deepl or chatGPT to translate to user target language

    • Deepl free api and GPT-3.5 turbo is free to use
    • Can set target language by user (for me: KO (text from wit.ai) -> ZH)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions