Skip to content

[Future] Consider recommendations from recent HDF5 talks/studies #129

@rly

Description

@rly

Consider recommendations from recent talks from the HDF Group and community on optimizing HDF5 data in the cloud, separating array data from metadata, and integrating with Zarr. Some of these may be relevant as background for our paper.

  1. Cloud Ready HDF5 – Matt Larson, John Readey, and Aleksandar Jelenak, The HDF Group (HUG24) (August 7, 2024): https://www.youtube.com/watch?v=2Iqv-adMF-U

    • Use larger chunk sizes (4-8 MB/chunk)
    • Use CoHDF5

    They mention that reading chunks with variable length data can result in many small remote reads (bad performance)

  2. "Cloud-Optimized HDF5 Files – Aleksandar Jelenak, The HDF Group #HUG23 (September 5, 2023)": https://www.youtube.com/watch?v=bDH59YTXpkc

And some other interesting talks from recent HDF5 User Group conferences:

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions