etl-pipelines

This project demonstrates a comprehensive data warehousing and analytics solution, from building a data warehouse to generating actionable insights. Designed as a portfolio project, it highlights industry best practices in data engineering and analytics.

data-modeling data-architecture etl-pipelines analytics-and-reporting

Updated Feb 8, 2026
TSQL

ragztigadi / BigData-ETL-Pipelines-Ecommerce

Star

Big Data ETL pipeline for Brazilian e-commerce data. Implements data ingestion, transformation, and storage using Apache Spark, Hadoop, and SQL. Designed for scalable data processing and analytics.

mysql sql mongodb python3 powerbi azure-databricks azure-devops etl-pipelines

Updated Apr 1, 2025
HTML

Willie-Conway / IBM-Relational-Database-Administrator-with-GenAI-Portfolio

Star

🗄️ IBM Relational Database Administrator with GenAI Certificate Portfolio – A comprehensive collection of projects, labs, and assignments showcasing expertise in relational database administration, 🏘️data warehousing, 🔁ETL pipelines, and 🤖Generative AI integration for modern database management.

Updated Feb 22, 2026
PLpgSQL

Willie-Conway / IBM-Data-Engineering-Portfolio

Star

🚀 A comprehensive showcase of projects and skills from the IBM Data Engineering Professional Certificate! 📚 Features include: 🔄 ETL pipelines, 🗄️ data warehousing, ⚡ big data processing with Spark/Hadoop, 🛠️ database administration, and 📈 business intelligence dashboards. Built with 🦾 to demonstrate real-world data engineering capabilities!

Updated Mar 1, 2026
PLpgSQL

edugmenes / azure-data-engineering

Star

This repository contains my first end-to-end Data Engineering project, built using Microsoft Azure Cloud and Azure Databricks with PySpark.

data cloud spark azure pyspark data-structures data-engineering databricks microsoft-azure delta-lake etl-pipelines lakehouse data-lakehouse medallion-architecture lakehouse-architectures

Updated Jan 29, 2026
Jupyter Notebook

Ratnesh-181998 / AWS-Services-For-Data-Engineering-With-Projects

Star

Master the AWS Data Stack! 🚀 This repository features 15+ Industrial Data Engineering Projects covering Serverless ETL, Real-Time Streaming, & Data Warehousing. Hands-on labs for S3, Lambda, Spark, Airflow, Snowflake, Redshift, Kinesis, & Glue. Includes production-grade CICD pipelines. A complete roadmap to becoming a top Data Professional.

aws aws-lambda snowflake pyspark data-engineering amazon-kinesis amazon-dynamodb cicd amazon-redshift real-time-streaming amazon-s3 amazon-athena apache-airflow aws-step-functions aws-glue github-actions delta-lake etl-pipelines

Updated Mar 6, 2026

siddharthgada / Udacity-Data-Engineering-with-AWS-Nanodegree

Star

Complete portfolio of data engineering projects from Udacity's Data Engineering with AWS Nanodegree.

apache-spark relational-databases nosql-database airflow-dags etl-pipelines lakehouse-architectures automated-workflows-using-aws-services

Updated Aug 7, 2025
Jupyter Notebook

Guilherme-B / baboon

Star

JSON-driven ETL pipeline framework prototype

json dag bonobo etl-pipelines

Updated Mar 25, 2020
Python

siddarthaThentu / Disaster-Response-Pipeline

Star

A deployed machine learning model that has the capability to automatically classify the incoming disaster messages into related 36 categories. Project developed as a part of Udacity's Data Science Nanodegree program.

bootstrap flask machine-learning plotly python3 data-analytics hyperparameter-optimization feature-engineering ensemble-models ml-pipelines etl-pipelines

Updated Jun 10, 2021
Python

Improve this page

Add a description, image, and links to the etl-pipelines topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the etl-pipelines topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

etl-pipelines

Here are 35 public repositories matching this topic...

Zipstack / unstract

yobix-ai / extractous

Burla-Cloud / burla

patterns-app / patterns-devkit

level-vc / useful

datacompose / datacompose

Chek0rrdn / DataEngineer_ETL

abrahamkoloboe27 / Airflow-Pipeline-Dashboard-Compagnie-Aerienne

EmmanuelEzenwere / DataSift

angelxd84130 / Airflow-ETL

prneidhardt / Apache-Data-Pipeline

chetnarathore10 / data_warehouse_project

ragztigadi / BigData-ETL-Pipelines-Ecommerce

Willie-Conway / IBM-Relational-Database-Administrator-with-GenAI-Portfolio

Willie-Conway / IBM-Data-Engineering-Portfolio

edugmenes / azure-data-engineering

Ratnesh-181998 / AWS-Services-For-Data-Engineering-With-Projects

siddharthgada / Udacity-Data-Engineering-with-AWS-Nanodegree

Guilherme-B / baboon

siddarthaThentu / Disaster-Response-Pipeline

Improve this page

Add this topic to your repo