TraceableTTS

Traceable TTS: Toward Watermark-Free TTS with Strong Traceability

Abstract

Recent advances in Text-To-Speech (TTS) technology have enabled synthetic speech to mimic human voices with remarkable realism, raising significant security concerns. This underscores the need for traceable TTS models—systems capable of tracing their synthesized speech without compromising quality or security. However, existing methods predominantly rely on explicit watermarking on speech or on vocoder, which degrades speech quality and is vulnerable to spoofing. To address these limitations, we propose a novel framework for model attribution. Instead of embedding watermarks, we train the TTS model and discriminator using a joint training method that significantly improves traceability generalization while preserving—and even slightly improving—audio quality. This is the first work toward watermark-free TTS with strong traceability.

Paper

arXiv: arXiv:2507.03887

Acknowledgement

I would like to extend a special thanks to authors of F5-TTS, since our code base is mainly borrowed from F5-TTS.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
discriminator		discriminator
.DS_Store		.DS_Store
README.md		README.md
gan_train.sh		gan_train.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TraceableTTS

Abstract

Paper

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TraceableTTS

Abstract

Paper

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages