Demo Data Product

A reference implementation demonstrating best practices for building data products in the UiTwisselingsplatform (UWP) data mesh. This dataproduct serves as a learning resource and onboarding example for new data product developers.

Overview

This demo dataproduct transforms CSV data into RDF (Resource Description Framework) linked data and publishes it to a Fuseki SPARQL endpoint. It demonstrates the standard transformation pipeline pattern used across data products in the platform, including:

Incremental data processing: Only processes new or modified data based on timestamps
Multi-entity type support: Processes multiple entity types (locations, activities, participants, etc.) in a data-driven configuration
Data quality validation: Validates generated RDF against SHACL shapes before loading
Graph isolation: Stores each entity type in its own named graph for better organization
Python version compatibility: Validates Python version against Paketo buildpack requirements
Structured logging: Provides clear, timestamped logs for monitoring and debugging

What This Dataproduct Does

Reads CSV data from input files for different entity types (locations, activities, participants, etc.)
Transforms CSV to RDF using YARRRML mappings that define how tabular data maps to RDF triples
Validates the RDF against SHACL shapes to ensure data quality
Loads the RDF into Fuseki, a SPARQL server, making the data queryable via SPARQL
Tracks data freshness by comparing input data timestamps with what's already in Fuseki, enabling incremental updates

Key Features

✅ Production-ready pattern: Demonstrates the standard transformation pipeline used in production dataproducts
✅ Well-documented: Comprehensive documentation for both transformation code and output exploration
✅ Best practices: Includes Python version checking, structured logging, error handling, and incremental processing
✅ Educational: Designed as a learning resource with clear code structure and extensive comments

Documentation

This dataproduct includes detailed documentation for different aspects:

Transformation Code Documentation: Comprehensive guide to understanding and modifying the transformation pipeline, including:
- Main script architecture and patterns
- Python version compatibility checking
- Fuseki client usage
- RDF generation and validation
- CSV utilities and date comparison
- Configuration and logging
Linked Data Output Documentation: Guide to exploring and querying the RDF data stored in Fuseki, including:
- SPARQL query examples
- Schema discovery queries
- Graph management operations
- Best practices for querying linked data

Getting Started

This dataproduct is used in the Data Product Developer Onboarding Documentation of the UiTwisselingsplatform.

Note: The Confluence documentation links below refer to documentation in Dutch. The code and inline documentation in this repository are in English.

For detailed information on:

Running the transformation code: See Running Transformation Code
Writing transformation code: See Writing Transformation Code

Repository Information

This repository is maintained by publiq in Bitbucket.

Repository Maintenance

Syncing Private and Public Repositories

Changes to this repository must be manually synced with the public GitHub repository.

Technical documentation about this sync process is available in the private Confluence documentation (publiq internal access required).

Name		Name	Last commit message	Last commit date
Latest commit History 123 Commits
transformer		transformer
.gitignore		.gitignore
README.md		README.md
data-product.yaml		data-product.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Demo Data Product

Overview

What This Dataproduct Does

Key Features

Documentation

Getting Started

Repository Information

Repository Maintenance

Syncing Private and Public Repositories

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

cultuurnet/uwp-demo-dataproduct

Folders and files

Latest commit

History

Repository files navigation

Demo Data Product

Overview

What This Dataproduct Does

Key Features

Documentation

Getting Started

Repository Information

Repository Maintenance

Syncing Private and Public Repositories

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages