Dagster + Snowflake + dbt Demo

A modern data stack demonstration showcasing orchestration, transformation, and cloud data warehousing integration.

What This Project Does

This project demonstrates a complete data pipeline using industry-standard tools:

🎯 Dagster: Orchestrates data workflows and manages asset dependencies
🔄 dbt: Transforms raw data into analytics-ready models using SQL
❄️ Snowflake: Provides scalable cloud data warehouse for storage and compute

Architecture

┌─────────────┐    ┌─────────────┐    ┌─────────────┐
│   Dagster   │───▶│     dbt     │───▶│  Snowflake  │
│ (Orchestration) │ │ (Transform) │    │ (Warehouse) │
│             │    │             │    │             │
│ - Asset mgmt│    │ - SQL models│    │ - Storage   │
│ - Scheduling│    │ - Tests     │    │ - Compute   │
│ - Monitoring│    │ - Docs      │    │ - Scaling   │
└─────────────┘    └─────────────┘    └─────────────┘

Getting Started

GitHub Codespaces (Recommended)

🚀 One-click development environment: Open in GitHub Codespaces for zero-setup development.

See GH_CODESPACES.md for complete Codespaces documentation.

Project Structure

dagster-snowflake-dbt-demo/
├── dagster-demo/          # Orchestration layer
│   └── dagster_demo/      # Dagster assets and jobs
├── dbt_demo/              # Data transformation layer
│   ├── models/            # SQL transformation models
│   └── dbt_project.yml    # dbt configuration
└── venv/                  # Python virtual environment

Local Development

Setup Environment: Activate the Python virtual environment
Configure Connections: Ensure Snowflake credentials are properly configured
Run Pipeline: Execute data workflows through Dagster UI or CLI
View Results: Monitor pipeline execution and inspect transformed data

Key Features

Unified Orchestration: Dagster manages the entire data pipeline lifecycle
SQL-First Transformations: dbt enables analytics engineers to build reliable data models
Cloud-Native: Leverages Snowflake's elastic compute and storage
Data Quality: Built-in testing and validation at every step
Observability: Comprehensive monitoring and lineage tracking
Automated CI/CD: Dagger-powered continuous integration with comprehensive testing

CI/CD Pipeline

This project includes a robust CI/CD pipeline powered by Dagger that runs on every push and pull request:

🔍 Code Linting: Black, Ruff, and isort for code quality
🧪 Testing: Pytest for unit and integration tests
✅ Dagster Validation: Ensures all assets and definitions load correctly
🔨 dbt Validation: Validates SQL models and compilation
🛡️ Security Scanning: Safety and Bandit for dependency and code security

The CI pipeline uses an in-memory DuckDB database for dbt validation, requiring no external dependencies.

Use Cases

This pattern is ideal for:

Analytics Engineering: Building reliable data models for BI and reporting
Data Pipeline Automation: Scheduling and monitoring data workflows
Data Quality Assurance: Implementing tests and checks throughout the pipeline
Team Collaboration: Enabling data teams to work with familiar SQL-based tools

Technologies

Python 3.13+: Runtime environment
Dagster: Data orchestration platform
dbt: Data transformation framework
Snowflake: Cloud data platform
SQL: Primary transformation language

This demo showcases modern data engineering practices using open-source orchestration with enterprise-grade data infrastructure.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.dagger		.dagger
.devcontainer		.devcontainer
.github/workflows		.github/workflows
.ssh/snowflake		.ssh/snowflake
dagster-demo		dagster-demo
dbt_demo		dbt_demo
reports		reports
scripts		scripts
.daggerignore		.daggerignore
.gitignore		.gitignore
DAGGER_README.md		DAGGER_README.md
GH_CODESPACES.md		GH_CODESPACES.md
LICENSE		LICENSE
QUICK_START.md		QUICK_START.md
README.md		README.md
dagger.json		dagger.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Dagster + Snowflake + dbt Demo

What This Project Does

Architecture

Getting Started

GitHub Codespaces (Recommended)

Project Structure

Local Development

Key Features

CI/CD Pipeline

Use Cases

Technologies

About

Uh oh!

Releases

Packages

Languages

License

tony-engineering/dagster-snowflake-dbt-demo

Folders and files

Latest commit

History

Repository files navigation

Dagster + Snowflake + dbt Demo

What This Project Does

Architecture

Getting Started

GitHub Codespaces (Recommended)

Project Structure

Local Development

Key Features

CI/CD Pipeline

Use Cases

Technologies

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages