Change VAE model source in inference.py by zouzhekang · Pull Request #364 · bytedance/LatentSync

zouzhekang · 2026-05-19T05:52:57Z

I fine-tuned the decoder part of the VAE to enable it to produce clearer teeth and lips.

我优化了vae的解码部分，能增强牙齿和嘴唇边缘的清晰度。

I fine-tuned the decoder part of the VAE to enable it to produce clearer teeth and lips.

zouzhekang · 2026-05-19T06:00:37Z

主要是制作数据重训sd-vae-ft-mse的解码器部分，对齐方式和latentsync保持一致，原来用MSE会出现更平滑的结果，我改为CharbonnierLoss，另外对嘴部区域的loss做了局部感知加强，实测最终效果无色差和连续性问题，效果如下（仔细看牙齿和嘴唇边缘会清晰一点）：
→

Change VAE model source in inference.py

7c3902c

I fine-tuned the decoder part of the VAE to enable it to produce clearer teeth and lips.

Provide feedback