Skip to content

redpheonixx/Gym_Summary_Stream_Processing_in_Lakehouse

Repository files navigation

Gym Summary Stream Processing in Lakehouse

Data engineering pipeline for gym data processing leveraging Pyspark, Databricks, and Azure ADLS.

Data Flow Diagram

PR

Overview

  • Developed a robust data engineering pipeline for gym data processing leveraging Pyspark, Databricks, and Azure ADLS.
  • Orchestrated ingestion from diverse sources including CSV, JSON files, and Kafka topics for comprehensive data acquisition.
  • Implemented efficient data processing workflows utilizing Databricks Unity Catalog for streamlined data management and accessibility.
  • Implemented Medallion architecture to strategically structure data within the lakehouse environment.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published