Skip to content

It's possible to finetune with Chinese voice database? #18

Description

@urn333

Due diligence

  • I have done my due diligence in trying to find the answer myself.

Topic

The paper

Question

I'm working on a humanoid robot project and looking into real-time speech models. I have a dataset of Chinese *.wav files, each with a corresponding JSON file. Would this be sufficient to fine-tune a model, Can Mandarin be used during inference?

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions