Skip to content

Conversation

@frutiemax92
Copy link
Contributor

This script combines the T5 embeddings and VAE features extraction and finetuning. It also supports the usual folder with images and captions inside and also the huggingface dataset with the possibility of adding the embeddings and features and push the dataset to the hub. It uses also Accelerate and the mapping of the dataset uses multiprocessing with multi-gpu capabilities. It features the basic constant learning parameters but this is easily adjustable.

@frutiemax92 frutiemax92 force-pushed the train_pixart_diffusers branch 2 times, most recently from 3d7d91e to 29b8730 Compare July 22, 2024 00:04
@frutiemax92 frutiemax92 force-pushed the train_pixart_diffusers branch from 29b8730 to c6e914e Compare July 22, 2024 00:20
@frutiemax92 frutiemax92 marked this pull request as draft July 22, 2024 01:10
@frutiemax92 frutiemax92 marked this pull request as ready for review July 22, 2024 23:06
@frutiemax92
Copy link
Contributor Author

Tested it with 2 GPUs on a docker container on runpod, it is working after some troublesome docker configuration issues.

@frutiemax92 frutiemax92 force-pushed the train_pixart_diffusers branch 2 times, most recently from 16c1a4c to bcf2194 Compare July 23, 2024 18:52
@frutiemax92 frutiemax92 marked this pull request as draft July 23, 2024 18:53
@frutiemax92 frutiemax92 force-pushed the train_pixart_diffusers branch from bcf2194 to d0cf2a3 Compare July 23, 2024 18:55
@frutiemax92 frutiemax92 marked this pull request as ready for review August 7, 2024 22:59
@frutiemax92
Copy link
Contributor Author

The code is working good now this should be useful for some people.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant