Welcome to the FOAG Metadata repository! Here, you'll find the metadata files that describe various datasets of ours, the Python automation script responsible for merging and validating our metadata files, as well as the JSON Schemas that ensure each record follows a consistent format. Under data/processed, we store the combined metadata ready for use in the FOAG Data Catalog. This setup helps maintain high-quality, well-structured metadata that is compatible with both I14Y and opendata.swiss.
The metadata you find here is displayed in a more user-friendly way by our data catalog web application.
This repository includes a Quality Assurance Dashboard built with Streamlit and deployed statically via Stlite (Python in the browser). It allows data stewards to audit datasets against the dataset.json schema, check for broken links, and calculate a "FAIRC" quality score.
The dashboard runs entirely in the client's browser (Serverless).
- Builder: GitHub Actions runs
src/audit.pyto validate links and calculate scores. - Snapshot: The results are saved to
dashboard/data_snapshot.json. - Viewer: The
dashboard/folder is published to GitHub Pages.index.htmlloads the Stlite engine, which executesapp.pyusing the JSON snapshot.
Ensure you have Python 3.12+ installed.
-
Clone the repository:
git clone https://github.com/blw-ofag-ufag/metadata.git cd metadata -
Install dependencies: It is highly recommended to use a virtual environment.
# Create virtual env (optional but recommended) python -m venv venv source venv/bin/activate # On Windows use: venv\Scripts\activate # Install packages pip install -r requirements.txt
Before running the dashboard, you must generate the data. This script processes raw JSON, performs async URL health checks, and creates the static JSON snapshot.
# Run the audit pipeline from the project root
# Generates: dashboard/data_snapshot.json
python -m src.auditstreamlit run dashboard/app.py