Description
From the README (https://github.com/DigitalPhonetics/VoicePAT?tab=readme-ov-file#anonymization) it says that to anonymize my own data I should modify the following fields in the config file:
data_dir: # path to original data in Kaldi-format for anonymization
results_dir: # path to location for all (intermediate) results of the anonymization
models_dir: # path to models location
Just wondering what exactly is the Kaldi-format. I guess it refers to a text file with each line of the format <id> <wav path>
, but just want to double-check.
The README also says:
Pretrained models for this anonymization can be found at https://github. com/DigitalPhonetics/speaker-anonymization/releases/tag/v2.0 and earlier releases.
But the link contains several zip files to download and it is very unclear what should be done here.
Would appreciate it if some more details could be provided. I totally understand this toolkit is under construction -- just raising my questions here.
Activity