Skip to content

DALLE trained on FashionGen Dataset RESULTS 💯  #443

Open
@alexriedel1

Description

@alexriedel1

DALLE on FashionGen

  • I trained Dall-E + VQGAN on the FashionGen dataset (https://arxiv.org/abs/1806.08317) on Google Colab and got decent results.
  • Without the VQGAN training on the FashionGen dataset, DALLE is really bad at generating faces which makes clothing generations looking extremely strange.

Text to image generation and re-ranking by CLIP

Best 16 of 48 generations ranked by CLIP

Generations from the training set (Including their Groundtruths)

Download (5)
Download (6)
Download (7)
Download (8)
Download (4)

Generations based on custom prompts (withouttheir Groundtruths)

Download (1)
Download (2)
Download (3)
Download (9)
Download

Model specifications

VAE
Trained VQGAN for 1 epoch on Fashion-Gen dataset
Embeddings: 1024
Batch size: 5

DALLE
Trained DALLE for 1 epoch on Fashion-Gen dataset
dim = 312
text_seq_len = 80
depth = 36
heads = 12
dim_head = 64
reversible = 0
attn_types =('full', 'axial_row', 'axial_col', 'conv_like')

Optimization
Optimizer: Adam
Learning rate: 4.5e-4
Gradient Clipping: 0.5
Batch size: 7

image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions