Questions regarding resolution and aspect ratio bucketing #495

jferments · 2025-08-25T09:35:08Z

jferments
Aug 25, 2025

I am trying to do a full fine tune of Qwen-Image, and am currently working on dataset preparation. I have a ~5 million image captioned dataset, which is mostly ~1MP images, but includes images with a wide variety of different dimensions (e.g. 768x1152, 1024x688, 856x1280, etc etc)

I want to pick a bunch of common aspect ratio buckets and group these pictures into them, and then crop them to fit into the nearest sized AR bucket (e.g. into one of (1, 1), (4, 3), (3, 4), (16, 9), (9, 16), (5, 4), (4, 5), (3, 2), (2, 3), (7, 5), (5, 7), etc) to minimize the amount of cropping that is occuring.

I am fine to write my own code to sort the images into AR buckets, and do the cropping myself, but I am just trying to figure out how to make musubi-tuner utilize these different buckets when training? What do I set under the "resolution" variable in the TOML file? Does musubi-tuner automatically handle AR bucketing somehow for diverse datasets like mine, or will I need to write custom bucketing/cropping code?

Can someone give me a detailed breakdown of how aspect ratio bucketing works in terms of training and dataset configuration? Thanks!

kohya-ss · 2025-08-25T12:28:25Z

kohya-ss
Aug 25, 2025
Maintainer

The code to create the buckets is written here:

musubi-tuner/src/musubi_tuner/dataset/image_video_dataset.py

Lines 469 to 480 in fec404c

    
           # prepare bucket resolution 
        
           self.no_upscale = no_upscale 
        
           sqrt_size = int(math.sqrt(self.bucket_area)) 
        
           min_size = divisible_by(sqrt_size // 2, self.reso_steps) 
        
           self.bucket_resolutions = [] 
        
           for w in range(min_size, sqrt_size + self.reso_steps, self.reso_steps): 
        
               h = divisible_by(self.bucket_area // w, self.reso_steps) 
        
               self.bucket_resolutions.append((w, h)) 
        
               self.bucket_resolutions.append((h, w)) 
        
           self.bucket_resolutions = list(set(self.bucket_resolutions)) 
        
           self.bucket_resolutions.sort()

This code will generate the following buckets when the resolution is 1024,1024 and reso_steps is 16 (common for all models).

bucket_resolutions: [(512, 2048), (528, 1984), (544, 1920), (560, 1872), (576, 1808), (592, 1760), (608, 1712), (624, 1680), (640, 1632), (656, 1584), (672, 1552), (688, 1520), (704, 1488), (720, 1456), (736, 1424), (752, 1392), (768, 1360), (784, 1328), (800, 1296), (816, 1280), (832, 1248), (848, 1232), (864, 1200), (880, 1184), (896, 1168), (912, 1136), (928, 1120), (944, 1104), (960, 1088), (976, 1072), (992, 1056), (1008, 1040), (1024, 1024), (1040, 1008), (1056, 992), (1072, 976), (1088, 960), (1104, 944), (1120, 928), (1136, 912), (1168, 896), (1184, 880), (1200, 864), (1232, 848), (1248,
832), (1280, 816), (1296, 800), (1328, 784), (1360, 768), (1392, 752), (1424, 736), (1456, 720), (1488, 704), (1520, 688), (1552, 672), (1584, 656), (1632, 640), (1680, 624), (1712, 608), (1760, 592),
(1808, 576), (1872, 560), (1920, 544), (1984, 528), (2048, 512)]

Each image is resized and cropped to the resolution of the bucket with the closest aspect ratio.

0 replies

FurkanGozukara · 2025-08-25T12:56:51Z

FurkanGozukara
Aug 25, 2025
Sponsor

@kohya-ss when user enables bucketing is this what is auto used?

1 reply

kohya-ss Aug 25, 2025
Maintainer

@kohya-ss when user enables bucketing is this what is auto used?

Yes, this is how Musubi Tuner's Aspect Ratio Bucketing works.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Questions regarding resolution and aspect ratio bucketing #495

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Questions regarding resolution and aspect ratio bucketing #495

Uh oh!

Uh oh!

jferments Aug 25, 2025

Replies: 2 comments · 1 reply

Uh oh!

kohya-ss Aug 25, 2025 Maintainer

Uh oh!

FurkanGozukara Aug 25, 2025 Sponsor

Uh oh!

kohya-ss Aug 25, 2025 Maintainer

jferments
Aug 25, 2025

Replies: 2 comments 1 reply

kohya-ss
Aug 25, 2025
Maintainer

FurkanGozukara
Aug 25, 2025
Sponsor

kohya-ss Aug 25, 2025
Maintainer