Skip to content

Small improvements that could be made #8

@axeld5

Description

@axeld5

Hello, I am currently running moshi-finetune for a personal project (goal : checking if small amount of QA samples can enable Moshi to learn latently knowledge about a topic)

After making it work, a few notices:

  • It would be better to more emphasize the need for stereo datasets : it is currently a very small note in the repo, but is necessary to make it work ; perhaps a few additional words in the tutorial could be helpful ; perhaps even add a "check if ready" python script that verifies whether or not the dataset is completely ready? Use soundfile, check if all audios have info.channels = 2 (and if there is also a need for the frequency to be 44.1k hz, relabels the frequency)
  • Having to rm -rf the run_dir after each bugged attempt of launching the train script was tedious : could be interesting to have in command line some sort of "-r" in args that deletes the folder if present? Did I miss it?
  • Having to set-up export Cuda Visible Devices = 0 and Cuda Device Order = PCI_BUS_ID every time I relaunched my VM was also tedious : is there something I missed here? Or perhaps there is a way to set it up easier?

If you think it's interesting, I have also no problem contributing to a PR that could improve all said elements!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions