Skip to content
View markiv25's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report markiv25

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
markiv25/README.md

Hi there, I'm Vikram Parmar πŸ‘‹

Email LinkedIn




πŸ§‘β€πŸ’» Data Engineer @ Anuvu | M.S. Information Technology & Analytics, RIT | DBA @ Westcliff University

I build robust data pipelines and architect scalable data platforms for inflight entertainment systems serving airlines like Southwest, Turkish, and Air France. Passionate about modernizing legacy infrastructure and turning raw flight data into reliable, actionable insights.

  • πŸ”­ Currently working on Kafka streaming pipelines, Power BI dashboards, and legacy system modernization
  • 🌱 Deepening expertise in data architecture, distributed systems, and real-time ingestion
  • πŸ’¬ Ask me about Python, Spark, SQL, ClickHouse, or anything data engineering
  • πŸ“« Reach me at parmar.vik25@gmail.com or LinkedIn
  • ⚑ Fun fact: I debug pipelines faster than I cook πŸ…

πŸš€ Featured Projects & Initiatives

πŸ—„οΈ MariaDB β†’ Distributed Database Migration
Led a high-stakes migration of a standalone MariaDB instance to a distributed database architecture after the legacy DB became a critical bottleneck for the entire pipeline. The work involved deep query refactoring across the codebase, latency optimization, extensive testing, and ACID compliance validation. Outcome: horizontal scalability, higher transaction volume handling, and eliminated single points of failure.

⚑ MariaDB RDS β†’ ClickHouse Aggregation Engine
Designed and implemented a migration of the aggregation layer from MariaDB RDS (temp table-based) to ClickHouse to handle ~20,000 daily jobs. Rebuilt aggregation logic to leverage ClickHouse's columnar storage, improving analytical query performance and compression significantly, while continuing to load final results into production MariaDB.

🐍 Python 2 β†’ Python 3 Pipeline Modernization
Leading a full codebase migration from Python 2 to Python 3 across Anuvu's data pipeline infrastructure, integrating Apache Spark and Airflow to replace legacy tooling. Focused on improving pipeline efficiency, maintainability, and performance for post-flight data ingestion, extraction, and storage workflows.

πŸ“Š Post-Flight Data Pipeline & SLA Reporting
Maintains and optimizes end-to-end data pipelines for ingesting, processing, and storing post-flight data. Generates insights and automated reports for product managers and supports invoicing and SLA compliance across airline clients.


πŸ› οΈ Languages & Tools

Core Stack

Python SQL Apache Spark Apache Kafka Apache Airflow

Databases & Storage

ClickHouse MariaDB MongoDB MySQL

Visualization & BI

Power BI Tableau

Other Tools

Git JavaScript React


πŸŽ“ Education

πŸ† M.S. Information Technology & Analytics β€” Rochester Institute of Technology (RIT)
πŸ“š DBA, Information Technology & Management β€” Westcliff University (In Progress)
πŸŽ“ B.E. Electronics & Telecommunication β€” SIES GST, Mumbai University


Vikram's GitHub Stats

Pinned Loading

  1. 612_project_Chatbot 612_project_Chatbot Public

    Java

  2. DW_ISTE724 DW_ISTE724 Public

  3. Iot_Anomaly- Iot_Anomaly- Public

    Jupyter Notebook

  4. Laser_Tag Laser_Tag Public

    C++

  5. mongo mongo Public

    JavaScript

  6. React_Project React_Project Public

    JavaScript