Anonymization dataset and model format

From the README (https://github.com/DigitalPhonetics/VoicePAT?tab=readme-ov-file#anonymization) it says that to anonymize my own data I should modify the following fields in the config file:
```
data_dir:    # path to original data in Kaldi-format for anonymization
results_dir: # path to location for all (intermediate) results of the anonymization
models_dir:  # path to models location
```
Just wondering what exactly is the Kaldi-format. I guess it refers to a text file with each line of the format `<id> <wav path>`, but just want to double-check.
The README also says:

> Pretrained models for this anonymization can be found at [https://github. com/DigitalPhonetics/speaker-anonymization/releases/tag/v2.0](https://github.com/DigitalPhonetics/speaker-anonymization/releases/tag/v2.0) and earlier releases.

But the link contains several zip files to download and it is very unclear what should be done here.

Would appreciate it if some more details could be provided. I totally understand this toolkit is under construction -- just raising my questions here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anonymization dataset and model format #10

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Anonymization dataset and model format #10

Description

Activity

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions