Skip to content

Guidance on compression options with AnnData.write_zarr #2107

@d33bs

Description

@d33bs

Question

Hello, thanks for the great support on this package! I was trying to figure out how compression operates for the h5ad format and also for zarr to compare them equivalently (where possible). On the docs for AnnData.write_h5ad I noticed this excerpt:

Datasets written with hdf5plugin-provided compressors cannot be opened without first loading the hdf5plugin library using import hdf5plugin. When using alternative compression filters such as zstd, consider writing to zarr format instead of h5ad, as the zarr library provides a more transparent compression pipeline.

When I navigated to the docs for AnnData.write_zarr I didn't find any arguments or guidance which could be used for compression options and the AnnData Zarr format.

  • Would you have any guidance or recommendations for compression options and AnnData.write_zarr? Please don't hesitate to point me to a link to learn more if available.
  • Could I ask for more clarification on "more transparent compression pipeline" from the AnnData.write_h5ad docs? I wasn't sure if this meant we could/should customize compression for Zarr exports outside of AnnData or if it meant "better" somehow than h5ad export compression performance (or maybe both).

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions