Skip to content

Add extinfo_mkngff script#75

Open
dominikl wants to merge 2 commits intoIDR:masterfrom
dominikl:extinfo_mkngff
Open

Add extinfo_mkngff script#75
dominikl wants to merge 2 commits intoIDR:masterfrom
dominikl:extinfo_mkngff

Conversation

@dominikl
Copy link
Copy Markdown
Member

Add script which sets the external info to the http:// URL for mkngff datasets.
Tested on:

idr0051 (Project): https://idr-testing.openmicroscopy.org/webclient/?show=project-552

idr0090 (Screen): https://idr-testing.openmicroscopy.org/webclient/?show=screen-2851

(installed ome/omero-zarr-pixel-buffer#13 on idr-testing, needed for http URL support)

(cc @jburel )

@dominikl dominikl requested a review from will-moore March 23, 2026 11:58
path = f"{path}/{pos}"
else:
path = get_filepaths_info(img)
path = f"{path}/0"
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

idr0091 images weren't created with bioformats2raw, so they won't need the /0.
All the other non-HCS images will need /0.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah ok thanks. Guess it needs a flag for "bioformats2raw layout" then. I'll use proper argparse then.

@will-moore
Copy link
Copy Markdown
Member

Since a couple of Screens have 400 or 500 Plates, we may want an option to do a range of plates (0-50, 50-100 etc) and/or to skip ones that have already been processed, so if processing of a Screen fails after 100 Plates, you can simply run it again and it'll skip a Plate if the first image already has externalInfo set?
Although if it fails halfway through a Plate, we'll need to repeat that whole Plate and not skip the part-completed Plate.
Just need some way to handle big Plate counts - as idr0013 and idr0016 could take a LONG time

else:
path = get_filepaths_info(img)
path = f"{path}/0"
if check(path):
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some idr0091 datasets e.g. https://idr.openmicroscopy.org/webclient/?show=dataset-11902 contain a mix of zarr and regular tiff images, so the check will fail with the tif images, which is fine as they can be ignored.

@dominikl
Copy link
Copy Markdown
Member Author

That should handle bioformats layout (/0) and non bf layout images. Also skip option to skip images with existing extinfo (would still have to iterate through all plates and images, but hopefully bit faster).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants