Skip to content
Matt Hipsey edited this page Sep 30, 2025 · 13 revisions

CSIEM environmental data management

The goal of the CSIEM Environmental Information Management framework, as presented herein, is to allow compatibility, inter-operability and between crticial data assets, and version control as is required for the development of a comprehensive and integrated modelling platform.

The data repository is based around the following structure:

csiem-data/
171M	./code
18M	./data-governance
31M	./data-mapping
165M	./summary-images
98G	./data-lake
65G	./data-warehouse
TOTAL = 165G	

The repository is built around a framework that brings together three separate steps in the data "federation" process:

  • Data Collation
  • Data Governance & Reporting
  • Data Integration

The relationship between the various iniatives, the CSIEM environmental data management framework, and downstream model applications are outlined schematically in the below image.

CSIEM Environmental Information Management

Data Collation

The aim of the data collation step is to bring data together in a co-ordinated way. Data that is sourced and collated from various government agencies, researchers and industry groups is stored in a “data lake” in their raw format. Each data provider is assigned a unique agency identifier, and datasets are also grouped based on the main programs or iniatives the collection was associated with. Raw data is stored in a rigid folder structure based on these two identifiers :

Agency/Program/ < ... data-sets ...>

CSIEM Data Collation

For more information, see the The Cockburn Sound Integrated Ecosystem Model Manual

CSIEM Data Wiki

Overview

Governance

Vocabularies

Storage & Access

Data Overview

Maps (NOTE: may not be current)

Clone this wiki locally