-
Couldn't load subscription status.
- Fork 1
Home
The goal of the CSIEM Environmental Information Management framework, as presented herein, is to allow compatibility, inter-operability and between crticial data assets, and version control as is required for the development of a comprehensive and integrated modelling platform.
The data repository is based around the following structure:
csiem-data/
171M ./code
18M ./data-governance
31M ./data-mapping
165M ./summary-images
98G ./data-lake
65G ./data-warehouse
TOTAL = 165G
The repository is built around a framework that brings together three separate steps in the data "federation" process:
- Data Collation
- Data Governance & Reporting
- Data Integration
The relationship between the various iniatives, the CSIEM environmental data management framework, and downstream model applications are outlined schematically in the below image.

The aim of the data collation step is to bring data together in a co-ordinated way. Data that is sourced and collated from various government agencies, researchers and industry groups is stored in a “data lake” in their raw format. Each data provider is assigned a unique agency identifier, and datasets are also grouped based on the main programs or iniatives the collection was associated with. Raw data is stored in a rigid folder structure based on these two identifiers :
Agency/Program/ < ... data-sets ...>

For more information, see the The Cockburn Sound Integrated Ecosystem Model Manual
Aquatic EcoDynamics