24khz or 16 khz？ #134

Open

opened

on Jul 26, 2025

The paper says that all audios are resampled at 16khz and it is trained on librilight. But the vae is working on 24khz latent?

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests