Skip to content

Catalogue

brendan-busch edited this page Aug 3, 2023 · 12 revisions

CSIEM Data Catalogue

All data entering the sytem is recording in the csiem_data_catalogue.xlsx spreadsheet that can be found the data-lake directory in the cseim-data GitHub repository. A summary overview diagram is shown below.

CSIEM Data Catalogue

A full outline of the latest CSIEM data catalogue is included below, subdivided into priorities.

  • High Priority: Data not uploaded into the data-lake folder structure, and is required to drive the model
  • Medium Priority: Data not uploaded into the data-lake folder structure, and is required to validate the model
  • Low Priority: Data not uploaded into the data-lake folder structure, and is required to add context to other data sources
  • Import Reqired: Data that has been uploaded into the lake, but not ingested into the warehouse
  • Import Finalised: Data that has been ingested into the warehouse, and may require review.

High Priority

Group Agency / Organisation Agency ID Program Program Code Date Range

Medium Priority

Group Agency / Organisation Agency ID Program Program Code Date Range

Low Priority

Group Agency / Organisation Agency ID Program Program Code Date Range

Import Required

Group Agency / Organisation Agency ID Program Program Code Date Range

Import Finalised

Group Agency / Organisation Agency ID Program Program Code Date Range

CSIEM Data Wiki

Overview

Governance

Vocabularies

Storage & Access

Data Overview

Maps (NOTE: may not be current)

Clone this wiki locally