Skip to content

James-Muguro/James-Muguro

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

118 Commits
 
 
 
 

Repository files navigation

Data Engineer | Pipelines · Warehousing · Cloud · Orchestration

Building reliable data infrastructure that scales.


👨‍💻 About Me

I'm a Data Engineer focused on designing and building the data infrastructure that organizations depend on. I specialize in engineering scalable pipelines, well-modeled warehouses, and automated workflows that make data clean, reliable, and ready for use at scale.

I write production-grade code, care deeply about data quality, and build systems that are easy to maintain and built to last.


🛠️ Tech Stack

Languages

Python SQL Bash

Pipelines & Orchestration

Apache Airflow Apache Kafka Apache Spark dbt

Databases & Warehouses

BigQuery PostgreSQL MySQL Snowflake

Dev & Collaboration

Docker Git GitHub Actions


🚀 What I Build

Area Details
🔄 ETL/ELT Pipelines Batch and streaming pipelines built for reliability and scale
🏗️ Data Warehousing Dimensional models and schemas optimized for downstream use
⚙️ Orchestration Automated, monitored workflows with Airflow and similar tools
🧹 Data Quality Testing frameworks, validation layers, and governance standards
📦 Data Transformation Clean, version-controlled transformations using dbt and SQL

⏱️ Coding Activity

Wakatime


🌱 Currently Exploring

  • Advanced streaming architectures with Kafka and Spark
  • Data lakehouse patterns with Delta Lake and Iceberg
  • Pipeline testing and observability best practices
  • dbt advanced features and package ecosystem

💡 "Good data engineering is invisible — systems just work, data just flows, and teams just trust it."

Profile Views

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors