Hi, Iβm Nikhilesh β an aspiring Data Engineer focused on building scalable data pipelines and data-driven systems.
I have hands-on experience with Azure Data Factory, Databricks, PySpark, and SQL, building end-to-end ETL pipelines processing millions of records. My work includes implementing Medallion Architecture (Bronze, Silver, Gold) and optimizing query performance.
I also have a strong foundation in Python (Django) and data warehousing concepts like Star Schema, fact and dimension modeling.
πΉ Currently focused on:
- Building real-world data engineering projects
- Strengthening PySpark & Azure skills
- Optimizing large-scale data processing
π‘ I enjoy solving data problems and turning raw data into structured, actionable insights.
- Built end-to-end ETL pipeline using ADF, Databricks, ADLS & Synapse
- Processed 5M+ records
- Implemented Medallion Architecture
- Improved pipeline performance by 30%
- Designed Star Schema (Fact & Dimension tables)
- Optimized SQL queries improving performance by 40%
- Built scalable data transformation pipelines
- Performed large-scale aggregations and optimizations
- Improved query efficiency using indexing & execution plans
- Compared raw vs optimized queries
Data Engineering: PySpark, Databricks, Azure Data Factory, ADLS, Synapse
Programming: Python, SQL
Databases: MySQL, SQL Server
Tools: Git, Power BI, Jupyter Notebook
Web (Basic): Django, HTML, CSS