This repository is home to presentations and demo projects created by the Dagster team.
If you have any questions, you are welcome to contact us on Slack, or through GitHub discussions.
- Deep Dive - Dagster Running Dagster: Compass
- Best Practices for LLM Development
- Small Data - Composable Data Workflows: Building Pipelines That Just Work
- All Things Open - Enabling community education
- Deep Dive - Dagster Running Dagster: Streaming
- Deep Dive - dltHub
- Deep Dive - Breaking Biology's Data Wall
- Deep Dive - Cooking with gas: Building a data platform at US Foods
- Deep Dive - Building Breakthrough AI Applications with Not Diamond
- Deep Dive - Shifting Left and Moving Forward
- Deep Dive - Orchestrating AI Agents
- Deep Dive - The Evolution of Data Catalogs
- Deep Dive - Dagster Modal Demo
- Building a True Data Platform: Beyond the Modern Data Stack
- Dagster, SDF, & the Evolution of the Data Platform
- Deep Dive - Data Quality
- Motherduck, Dagster, Evidence Demo
- Deep Dive - Enable Data Mesh
- Deep Dive - Thinking in Partitions
- Deep Dive - Resources & Configurations
- Deep Dive - Automations
|
π Resources: Slides β’ Video Learn how Dagster Labs uses Dagster to power its own internal data platform with Compass. |
|
π
Date: January 27, 2026 π₯ Authors: Alex Noonan, Dennis Hume, Colton Padden π Resources: Slides Learn best practices for integrating LLMs into Dagster pipelines, covering the development lifecycle, common pitfalls to avoid, and how to leverage the official Claude plugin for AI-powered data orchestration. |
|
π
Date: November 4, 2025 π₯ Authors: Dennis Hume π Resources: Slides β’ Source Code Learn how to build a zero-cost GitHub issue deduplication system using Dagster and DuckDB, covering data ingestion, embeddings generation, and composable workflow automation. |
|
π
Date: October 14, 2025 π₯ Authors: Colton Padden π Resources: Slides See how the team at Dagster creates educational content for the community with a little help from AI. |
|
π
Date: August 26, 2025 π₯ Authors: Nicholas Roach π Resources: Slides Learn how Dagster Labs uses Dagster to power its own internal data platform, including implementing streaming event ingestion for Dagster+ and testing experimental features in production. |
|
π
Date: July 8, 2025 π₯ Authors: Alena Astrakhantseva, Aashish Nair, Colton Padden, Alex Noonan π Resources: Slides Rapidly developing data pipelines with dltHub and Dagster |
|
π
Date: July 1, 2025 π₯ Authors: Keith Kam π Resources: Slides Learn how Basecamp Research scaled a multi-disciplinary data platform to power BioAI foundation models and overcome the unique challenges of managing biological data at scale. |
|
π
Date: April 1, 2025 π₯ Authors: Lee Littlejohn, Alex Noonan π Resources: Slides Discover how US Foods transformed their fragmented data infrastructure into a scalable, organized data platform using Dagster, Snowflake, and AWS with infrastructure as code, achieving improved visibility, faster onboarding, and team self-service. |
|
π
Date: February 11, 2025 π₯ Authors: Alejandro Companioni, Tomas Kofman, Colton Padden π Resources: Slides Learn how to use Not Diamond's intelligent LLM routing with Dagster to automatically select the best model for each task, optimizing for quality, cost, and latency in your AI pipelines. |
|
π
Date: January 14, 2025 π₯ Authors: Alex Noonan, Colton Padden, Jacob Matson π Resources: Slides β’ Source Code Learn how to build modern data pipelines with Dagster and MotherDuck through a hands-on Bluesky data platform demo that demonstrates shifting left in data engineering. |
|
π
Date: October 31, 2024 π₯ Authors: Olivier Dupuis, Izzy Miller, Colton Padden π Resources: Slides β’ Video Learn how to build and orchestrate AI agent prototypes within a data platform using LangChain, Hex, and Dagster to enrich unstructured climate narratives data. |
|
π
Date: October 15, 2024 π₯ Authors: Alex Noonan π Resources: Slides β’ Video Learn how data catalogs have evolved to enable data discovery through metadata collection and indexing, and explore Dagster's approach to addressing the limitations of traditional catalog solutions. |
|
π
Date: September 24, 2024 π₯ Authors: Charles Frye, Colton Padden π Resources: Slides β’ Source Code β’ Video Learn how to orchestrate scalable ML workloads by combining Dagster's state management and observability with Modal's auto-scaling infrastructure and GPU provisioning. |
|
π
Date: September 6, 2024 π₯ Authors: Pedram Navid π Resources: Slides β’ Video Explore the shortcomings of the Modern Data Stack and discover how to build a unified data platform using Dagster that delivers on the promises of scalability, observability, and maintainability. |
|
π
Date: August 22, 2024 π₯ Authors: Lukas Schulte, Pedram Navid π Resources: Slides β’ Video Learn how Dagster and SDF combine to bridge the gap between orchestration and transformation, enabling local SQL development, enhanced data quality, and seamless integration between data platforms. |
|
π
Date: August 6, 2024 π₯ Authors: Colton Padden π Resources: Slides β’ Video Learn how to build reliable data platforms using Dagster's data quality features and explore the six key dimensions of data quality: timeliness, completeness, accuracy, validity, consistency, and uniqueness. |
|
π
Date: April 18, 2024 π₯ Authors: Colton Padden, Alex Monahan π Resources: Slides Learn how to seamlessly transition data pipelines from local development to production using MotherDuck's hybrid compute architecture with Dagster orchestration, dbt transformations, and Evidence dashboards. |
|
π
Date: April 2, 2024 π₯ Authors: Tim Castillo π Resources: Slides β’ Source Code β’ Video Learn how to enable a data mesh paradigm with Dagster by implementing the four core principles of domain-driven ownership, data-as-a-product, federated governance, and self-serve platforms. |
|
π
Date: March 5, 2024 π₯ Authors: Tim Castillo π Resources: Slides β’ Video Learn how to use Dagster partitions to break the linear relationship between growing data volumes and pipeline resource consumption. |
|
π
Date: February 20, 2024 π₯ Authors: Colton Padden π Resources: Slides β’ Video Learn how to use Dagster's configurations and resources to build reusable, flexible data pipelines that can be tested locally and deployed to production with confidence. |
|
π
Date: February 13, 2024 π₯ Authors: Pedram Navid π Resources: Slides β’ Video Learn how to automate your Dagster data pipelines using schedules, sensors, and auto-materialize policies to eliminate manual execution and simplify complex workflows. |