10 resnet blocks, w/ semantically similar dataset reaches 1.5bpsp #13
Open
Description
i have a private dataset of 300k semantically similar images. with the default model (5 resnet blocks, default settings. trained on gtx1060) i was able to achieve 1.75bpsp.
i also made a model with 10 resnet blocks and a 256px crop size, and it has achieved 1.44bpsp. i think it could go further but my
gpu's are now working on something else.
note: my eval dataset consists of only 6 images.
when trying to compress a large image, it quickly runs out of memory. id like to modifiy this code to chop large images into smaller parts capable of fitting into the GPU.
this project is super cool thanks much!
Metadata
Assignees
Labels
No labels