Added the 4k model #5

Ataraksia · 2025-01-12T05:36:25Z

I noticed the 4K model hadn't been included yet, so I opted to make the necessary changes. A couple of things worth mentioning: I wasn't sure whether the P1 in SanaMS_1600M_P1_D20 represented patch size, pe interpolation, or something else entirely so I just left it as it was in the others. Additionally, my desire to be helpful didn't quite hold out long enough to manually change the resolution scaling numbers, and so I just left it to Claude 3.5 Sonnet. Obviously LLMs are not especially well suited to math, and while a cursory inspection of the numbers looks solid, you should probably double check them.

Edit: Hmm, so actually it would appear that all is not well after all. Before making this PR, I only did a single test generation with the changes, and that particular combination of seed/prompt was apparently just a fluke, because every image I've been able to generate since has ranged from subpar to unholy abomination. I'm still testing some things, and will update back if I have an luck.

Edit x 2: After a bit more testing, while I think it definitely could use some tweaking, a lot of my issues stemmed from poor Sana prompting practices due to my lack of experience with the model. After making some changes to my prompts, I've been able to get results of reasonable quality.

Ataraksia added 2 commits January 12, 2025 00:17

Include Sana_1600M_4Kpx_BF16 in list

1fd2431

Updated to include SanaMS_1600M_P1_D20_4K

a56d42b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added the 4k model #5

Added the 4k model #5

Ataraksia commented Jan 12, 2025 •

edited

Loading

Added the 4k model #5

Are you sure you want to change the base?

Added the 4k model #5

Conversation

Ataraksia commented Jan 12, 2025 • edited Loading

Ataraksia commented Jan 12, 2025 •

edited

Loading