🚄 Railway Trains: Databricks Performance Pipeline

A comprehensive data engineering project analyzing passenger train performance across the Dutch railway network. This pipeline processes historical stop and service data to generate actionable insights into delays, cancellations, and platform changes.

🏗️ Project Structure

data/: Documentation on data sources and a detailed data dictionary.
pipeline_code/: The core Databricks logic, organized into a Medallion architecture.
visuals/: Screenshots and a demo video of the final performance dashboard.

⚙️ Data Architecture (Medallion)

We use a multi-layered approach to transform raw data into insights:

Bronze (Raw): Raw CSV ingestion with schema evolution and basic sanitization.
Silver (Cleaned): Data typing, cleaning, and enrichment. Includes derived on-time flags (threshold <= 5 min) and performance classification.
Gold (Business): Optimized dimensional models (fact_stops, dim_station) and daily performance aggregations for reporting.

📊 Insights & Dashboards

The pipeline feeds a dashboard that tracks KPIs like:

Arrival/Departure On-Time %
Cancellation Rates
Platform Change Severity
Peak Hour Performance (Morning vs. Evening Rush)

Check out the visuals folder for more breakdowns.

🔗 Data Source

Data is curated from the NS API by Rijden de Treinen. You can find more details in how_to_get_data.md.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
data		data
pipeline_code		pipeline_code
visuals		visuals
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚄 Railway Trains: Databricks Performance Pipeline

🏗️ Project Structure

⚙️ Data Architecture (Medallion)

📊 Insights & Dashboards

🔗 Data Source

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🚄 Railway Trains: Databricks Performance Pipeline

🏗️ Project Structure

⚙️ Data Architecture (Medallion)

📊 Insights & Dashboards

🔗 Data Source

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages