Finetune script using the diffusers and datasets library #137

frutiemax92 · 2024-07-21T23:51:58Z

This script combines the T5 embeddings and VAE features extraction and finetuning. It also supports the usual folder with images and captions inside and also the huggingface dataset with the possibility of adding the embeddings and features and push the dataset to the hub. It uses also Accelerate and the mapping of the dataset uses multiprocessing with multi-gpu capabilities. It features the basic constant learning parameters but this is easily adjustable.

frutiemax92 · 2024-07-22T23:07:27Z

Tested it with 2 GPUs on a docker container on runpod, it is working after some troublesome docker configuration issues.

frutiemax92 · 2024-08-07T22:59:48Z

The code is working good now this should be useful for some people.

frutiemax92 force-pushed the train_pixart_diffusers branch 2 times, most recently from 3d7d91e to 29b8730 Compare July 22, 2024 00:04

Initial commit of finetune script using diffusers and datasets

c6e914e

frutiemax92 force-pushed the train_pixart_diffusers branch from 29b8730 to c6e914e Compare July 22, 2024 00:20

frutiemax92 marked this pull request as draft July 22, 2024 01:10

Multi-gpu support

1e5e124

frutiemax92 marked this pull request as ready for review July 22, 2024 23:06

frutiemax92 added 7 commits July 23, 2024 08:04

Use the same random seed for the bucket sampler

109100c

Optionnally push the transformer to the hub

9e8862d

Fix push_transformer_to_hub

e51573c

Use set_start_method from torch

0f04599

Dont destroy process group

9c1dcc1

Debug message

55d1d44

Rank issue

df75ea7

frutiemax92 force-pushed the train_pixart_diffusers branch 2 times, most recently from 16c1a4c to bcf2194 Compare July 23, 2024 18:52

frutiemax92 marked this pull request as draft July 23, 2024 18:53

Use the same scheme for vae as t5

d0cf2a3

frutiemax92 force-pushed the train_pixart_diffusers branch from bcf2194 to d0cf2a3 Compare July 23, 2024 18:55

frutiemax92 added 10 commits July 23, 2024 15:01

Handle dataset split correctly

e382f81

Fix deadlock in t5 mapping

d47af19

Fix bucket sampling to ensure fixed batch size

7e62e1b

Force a batch size for t5 embeds and vae features

f179acd

Fix a very large timeout to let the t5 embeds and vae finish

21ae683

Add option for multiprocess for embeds and features extraction

b6a7706

Split the features extraction into an another script

f203723

Fix vae mapping

adb2e31

Embeddings optimization

9ea2d28

Skip corrupted images

4741b09

frutiemax92 added 5 commits August 4, 2024 16:48

Add filtering for invalid images

a5d8dae

More fixes

a31e0f1

Fix with validation image

a0b5ba4

Flush tensors in the training loop to not create OOM

d0c084a

Disable dataset caching

e63cec4

frutiemax92 marked this pull request as ready for review August 7, 2024 22:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Finetune script using the diffusers and datasets library #137

Finetune script using the diffusers and datasets library #137

Uh oh!

frutiemax92 commented Jul 21, 2024

Uh oh!

frutiemax92 commented Jul 22, 2024

Uh oh!

frutiemax92 commented Aug 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Finetune script using the diffusers and datasets library #137

Are you sure you want to change the base?

Finetune script using the diffusers and datasets library #137

Uh oh!

Conversation

frutiemax92 commented Jul 21, 2024

Uh oh!

frutiemax92 commented Jul 22, 2024

Uh oh!

frutiemax92 commented Aug 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant