-
Notifications
You must be signed in to change notification settings - Fork 175
Open
Labels
Description
Question
Hello, thanks for the great support on this package! I was trying to figure out how compression operates for the h5ad
format and also for zarr
to compare them equivalently (where possible). On the docs for AnnData.write_h5ad
I noticed this excerpt:
Datasets written with hdf5plugin-provided compressors cannot be opened without first loading the hdf5plugin library using import hdf5plugin. When using alternative compression filters such as zstd, consider writing to zarr format instead of h5ad, as the zarr library provides a more transparent compression pipeline.
When I navigated to the docs for AnnData.write_zarr
I didn't find any arguments or guidance which could be used for compression options and the AnnData Zarr format.
- Would you have any guidance or recommendations for compression options and
AnnData.write_zarr
? Please don't hesitate to point me to a link to learn more if available. - Could I ask for more clarification on "more transparent compression pipeline" from the
AnnData.write_h5ad
docs? I wasn't sure if this meant we could/should customize compression for Zarr exports outside of AnnData or if it meant "better" somehow thanh5ad
export compression performance (or maybe both).