GitHub

🔥 wav2lip384x384 is ours LangXin_V1

This is a project about talking faces. We use384X384 sized facial images for training, which can generate720p, 1080p, 2k ,4k Digital Humanhuman videos. We have done the following work:

Add video cutting codes.
Add filelists to generate code.
Trained 1000 people, 50 hours, and over 50000 pieces of data.
Open sourced the checkpoint for a discriminator with 150000, 700000, 1000000 steps and a val_loss of 0.36, 0.33, 0.28.
open-source the checkpoint for generator with 300000---800000 steps, with val_loss values of 0.35---0.29 , performs very well and is recommended for use. Of course, it can also be loaded for further training.
Dear friends, generators with over 500000 steps have surpassed all open source projects on the market in terms of direct inference performance, and have reached a basic commercial level.
Dear friends, we released the best discriminator checkpoint, you need load pre training weights for easy subsequent training, many people have loaded our color_checkpoints and final_checkpionts for training, and achieved good results.Especially when solving profile and occlusion problems, it is only necessary to load the relevant dataset and continue training.
Due to the wav2lip high-definition algorithm series, it cannot achieve high fidelity of faces and teeth, and the training difficulty is relatively high, which cannot adapt well to current commercial needs. So we have changed the algorithm for commercial digital humans and adopted new algorithms such as diffusion.
Friends who want to train the wav2lip high-definition series, please think carefully before taking action.
If you want to achieve better reasoning results, then refer to my demo video for shooting.

🏗️ wav2lip-384x384 Project situation

Video | Project Page | Code

checkpoints for wav2lip384x384 https://pan.baidu.com/s/1NiSEdrlRVZM_6SD4Igdtlg?pwd=lzzx

📊 The following pictures are comparison images of the training generator training 500000 steps.

🎬 Demo

Original video	Lip-synced video
input-001.mp4	output-001.mp4
input-002.mp4	output-002.mp4
input-003.mp4	output-003.mp4
input-004.mp4	output-004.mp4

📑 Open-source Plan

For the wav2lip series, we will continue to train and release higher definition weights in the future. The plan is as follows: Pre training checkpoints for wav2lip_288x288 will be released in January 2025. Pre training checkpoints for wav2lip_384x384 will be released in February 2025. Pre training checkpoints for wav2lip_576x576 or 512x512 will be released after June 2025.

🙏 Citing

Thank you to the other three authors, Thank you for their wonderful work.

https://github.com/primepake/wav2lip_288x288

https://github.com/nghiakvnvsd/wav2lip384

https://github.com/Rudrabha/Wav2Lip

📖 Disclaimers

This repositories made by langzizhixin from Langzizhixin Technology company 2025.1.30 , in Chengdu, China . The above code and weights can only be used for personal/research/non-commercial purposes. Especially for digital human video models in the warehouse, if commercial use is required, please contact the model themselves for authorization. If you need a higher definition model, please contact us by email [email protected], [email protected], [email protected] or add ours WeChat for communication: langzizhixinkeji

Name		Name	Last commit message	Last commit date
Latest commit History 144 Commits
assets		assets
checkpoints		checkpoints
color_checkpoints		color_checkpoints
data		data
evaluation		evaluation
face_detection		face_detection
filelists		filelists
final_checkpionts		final_checkpionts
models		models
preprocessed_data		preprocessed_data
results		results
temp		temp
16000.py		16000.py
README.md		README.md
api.py		api.py
audio.py		audio.py
color_syncnet_train.py		color_syncnet_train.py
hparams.py		hparams.py
hq_wav2lip_train.py		hq_wav2lip_train.py
inference.py		inference.py
inference_advanced.py		inference_advanced.py
inference_realtime.py		inference_realtime.py
parallel_syncnet_tanh.py		parallel_syncnet_tanh.py
parallel_wav2lip_margin.py		parallel_wav2lip_margin.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
sample.py		sample.py
split_video.py		split_video.py
wav2lip_train.py		wav2lip_train.py
训练操作手册.ipynb		训练操作手册.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🔥 wav2lip384x384 is ours LangXin_V1

🏗️ wav2lip-384x384 Project situation

📊 The following pictures are comparison images of the training generator training 500000 steps.

🎬 Demo

📑 Open-source Plan

🙏 Citing

📖 Disclaimers

About

Uh oh!

Releases

Packages

Uh oh!

Languages

langzizhixin/wav2lip384x384

Folders and files

Latest commit

History

Repository files navigation

🔥 wav2lip384x384 is ours LangXin_V1

🏗️ wav2lip-384x384 Project situation

📊 The following pictures are comparison images of the training generator training 500000 steps.

🎬 Demo

📑 Open-source Plan

🙏 Citing

📖 Disclaimers

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages