Skip to content

panditrahulsharma/dbt_databrick_demo

Repository files navigation

🛠️ DBT Installation Guide

Install DBT (Core + Databricks Adapter)

python3 -m pip install dbt-core==1.8 dbt-databricks

Confirm Installation

dbt --version

🚀 Initialize a New DBT Project

dbt init

Update profiles.yml Location

Move the auto-generated profile into your project directory:

cat /Users/abc/.dbt/profiles.yml >> ./profiles.yml
rm -rf /Users/abc/.dbt/profiles.yml

🔧 Test DBT Setup

dbt debug

📥 Load Seed Data into Databricks

dbt seed

🏆 Run Gold Layer Models

dbt run --select project_1

📘 DBT Documentation Site

dbt docs generate
dbt docs serve

DBT Setup & Resources

📚 Helpful Articles

Building a Production-Ready dbt CI/CD Pipeline

🎓 Course Guide


📌 Project Phase 1 (P1) Required Steps

A strong DBT project should include the following:

  • Test Cases (generic + custom + unit testing) : Implemented
  • column level tagging and discription
  • freshness test
  • Table Descriptions & Documentation
  • Generic Test Cases (unique, not null, relationships, accepted values) : Implemented
  • Deployment Steps (Dev → QA → Prod): Implemented
  • Incremental Models & Snapshots
  • Multi‑environment Profiles (profiles.yml) : Implemented
  • Schema Documentation Example: https://github.com/reshmavarghese15/data-pipeline-dbt-aws-demo/blob/main/data_pipeline_dbt_aws_demo/models/example/schema.yml
  • Freshness Checks (sources: with freshness: block)
  • Data Quality Checks
  • CI/CD Pipeline Setup (GitHub Actions / Azure DevOps / GitLab CI)

About

Building DataLake with DBT and Databricks

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published