Skip to content

Intake Data Catalog For LOCA2 Datasets #2

@BenGalewsky

Description

@BenGalewsky

As a user of the Illinois CLIMAS Analysis Facility I want an accurate data catalog so I can find the data I need to complete my analysis

Problem

The LOCA2 dataset, downloaded from UCSD servers sits in 27 different models. Each model has four scenarios and each scenario has several member IDs. Assembling a coherent analysis from these zarr stores spread across multiple directories is challenging.

An Intake Catalog (https://intake.readthedocs.io/en/latest/user2.html) makes it much easier to find and collect these stores for an analysis.

Approach

Define a catalog that can represent the LOCA2 dataset and allows for searches that meet some of the most common use cases. Particular priority should be given to queries needed for the Illinois Department of Health science gateway.

If possible, the catalog should dynamically created as part of the LOCA2 Dagster assets.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions