Skip to content

MarineLebrec/CeNCOOS-to-Sanctuary-CR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

10 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Processing data from CeNCOOS to National Marine Sanctuary Condition Reports

This project converts ecological monitoring data stored in Darwin Core (DwC) CSV format into a standardized JSON format that can be more readily used by Sanctuary staff in their Web Condition Reports (CRs).

The example used in this repository uses data from the Multi-Agency Rocky Intertidal Network (MARINe) which has been processed into Darwin Core and published to GBIF/OBIS, and the CalOOS Data Portal.

The workflow takes user inputs about the data (e.g. the dataset name, indicator species), cleans and aggregates data, and outputs JSONs that align with reporting needs such as mean values, standard deviations, and station counts across years.

πŸ“‚ Project Structure

CeNCOOS-to-Sanctuary-CR/
β”‚
β”œβ”€β”€ helpers_new.py # Helper functions (data loading, processing, output writing)
β”œβ”€β”€ main.py # Driver script that prompts user for inputs and runs workflow
β”œβ”€β”€ JSON_outputs/ # Folder for generated JSON outputs for each indicator, separated out by Sanctuary
β”œβ”€β”€ JSON_inputs/ # Folder for input JSON files that contain information about what stations to query for a given Sanctuary
└── README.md # This file

βš™οΈ Workflow

  1. User selects input parameters
    When you run main.py, the script prompts you to provide:

    • Stations file (JSON describing stations for a sanctuary)
    • Source dataset (MARINe Transects, MARINe Photoplots, or MARINe Seastars)
    • Target assemblage (e.g., Mytilus)
    • Indicator species (e.g., Mytilus californianus)
  2. CSV ingestion
    The corresponding CSV file (Darwin Core format) is read and filtered to only include relevant assemblages/species.

  3. Data processing
    The pipeline:

    • Groups data by year and station
    • Builds pivot tables of values
    • Calculates mean, standard deviation, and station counts across stations per year
    • Assembles results into a JSON with a metadata block
  4. JSON output
    Results are saved to the JSON_outputs/ folder, with filenames based on sanctuary name, dataset, and assemblage.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published