Skip to content

Commit 5b05db4

Browse files
committed
refactor(resume): streamline technology labels and update margins
- Consolidate technology labels for clarity and conciseness - Update margins and font sizes for improved layout - Adjust section title and entry spacing for better readability - Enhance highlight formatting for consistency These changes improve the resume's readability and presentation by simplifying technology categories and optimizing layout settings.
1 parent d81c57c commit 5b05db4

1 file changed

Lines changed: 22 additions & 31 deletions

File tree

resume.yaml

Lines changed: 22 additions & 31 deletions
Original file line numberDiff line numberDiff line change
@@ -18,39 +18,29 @@ cv:
1818
- "Proven track record of exceptional performance optimization—including reducing runtime from 27.5-hours to under 5 seconds, and building scalable data architectures through event-driven design, and production Agentic LLM systems (>95% accuracy) on H100 clusters."
1919
- "Delivered measurable business outcomes: data infrastructure enabling a startup's $6M Series A, GDPR-compliant extracts for Advent International investor review, enterprise compliance for a Fortune 500 healthcare client, and real-time data pipelines saving 10-15 FTEs."
2020
technologies:
21-
- label: "Core Languages & Tools"
22-
details: "Python, SQL, BASH, JavaScript, Git"
23-
- label: "Data Engineering & Orchestration"
24-
details: "ETL/ELT/ETLT Pipelines, Data Modeling, Data Warehousing, Data Lakes, PySpark, Pandas, NumPy, DuckDB, DBT, Airflow, GCP Workflows, Kafka, RabbitMQ, Databricks, Microsoft Fabric, Azure Data Factory (ADF), Treasure Data"
25-
- label: "AI & LLM Engineering"
26-
details: "OpenAI, Gemini, LangChain, Agentic Architecture (MCP, A2A), Retrieval Augmented Generation (RAG), Axolotl, LLM Fine-tuning (SFT, DPO, RLHF), Prompt Engineering, Semantic Deduplication, TensorFlow, Keras, SerpAPI"
27-
- label: "Cloud & DevOps"
28-
details: "AWS, Azure, GCP, S3, Docker, Kubernetes, Terraform, Pulumi, ARM Templates, CI/CD Pipelines, Jenkins, Azure DevOps, NGINX, Linux, Cloud Cost Optimization"
29-
- label: "Application & API Development"
30-
details: "FastAPI, Django, DRF, Flask, REST APIs, Streamlit, CLI Tools (Typer, Textual)"
31-
- label: "Databases & Vector Stores"
32-
details: "PostgreSQL, MongoDB, BigQuery, MS SQL Server, MySQL, Neo4j, Chroma, FAISS"
33-
- label: "Security & Quality Assurance"
34-
details: "Zero-Trust Architecture, Cryptography (E2EE, AES-GCM, RSA), IAM, OAuth2/JWT, SSO/SAML, RBAC, Secret Management, PII Data Handling, Pytest, Unittest, Pydantic, Pandera, Great Expectations"
35-
- label: "Specialized & Visualization"
36-
details: "System Design, Solution Architecture, CDPs, SFMC, Web Scraping, Looker Studio, Power BI, Matplotlib, Plotly, GeoPandas, PostGIS, Realtime Monitoring Systems, Automations, Agile (Scrum/Kanban), Code Reviews, Technical Consultation"
21+
- label: "Data Engineering"
22+
details: "SQL, Databricks, BigQuery, MS Fabric, PySpark, DuckDB, DBT, Airflow, Kafka, ADF, Delta Lake, Pandas, Presto/Trino"
23+
- label: "AI & LLM Ops"
24+
details: "OpenAI, Gemini, LangChain, RAG, Agentic AI (MCP, A2A), Fine-tuning (SFT/DPO), Vector DBs (Chroma/FAISS), Neo4j"
25+
- label: "Backend, DevOps & Viz"
26+
details: "Python, BASH, Git, FastAPI, PostgreSQL, MongoDB, AWS, Azure, GCP, Docker, K8s, Terraform, CI/CD, Power BI"
3727
experience:
3828
- company: "FiftyFive Technologies"
3929
position: "Data Engineer"
4030
location: "Gurugram, HR (Remote)"
4131
start_date: "2022-06"
4232
summary: "Delivered end-to-end data and AI solutions for 10+ enterprise clients across healthcare, finance, logistics, and tech sectors. Responsibilities extended beyond technical delivery: conducted 100+ technical interviews, led client onboarding and scoping calls, created PoCs/demos, mentored teammates, and drove adoption of emerging technologies (DuckDB, MCP) across the organization.\\ **Client Engagements:**"
4333
highlights:
44-
- "**SBGC Group (Retail Analytics)**: Took ownership of a multi-region Azure/Databricks platform integrating SAP/Shopify/Klaviyo/GA4 to feed Hightouch CDP. Unblocked Advent International due diligence by reverse-engineering Shopify's logic to cut revenue variance from ~10% to \\<0.22%. Root-caused a critical legacy data explosion (slashing volume by 99%) and delivered an inventory dashboard worth ~$250k."
34+
- "**SBGC Group (Retail Data Platform)**: Took ownership of a multi-region Azure/Databricks platform integrating SAP/Shopify/Klaviyo/GA4. Unblocked Advent International due diligence by reverse-engineering Shopify's logic to cut revenue variance from ~10% to \\<0.22%. Root-caused a critical legacy data explosion (slashing volume by 99%) and delivered an inventory dashboard worth ~$250k."
4535
- "**AGY Logistics (IoT & Automation)**: Sole architect of a GDP-compliant cold chain system on GCP (BigQuery, Cloud Run). Automated complex manual workflows previously requiring 10-15 US-based operations staff, resulting in massive OpEx savings. Engineered smart time-series merging and statistical analysis on a 4-layer medallion architecture (including a 'Diamond Layer' for sub-second alerts) to reduce false positives by 70%."
46-
- "**Johnson & Johnson (Healthcare CDP)**: Architected scalable Treasure Data pipelines for regulated healthcare markets (JP/ANZ). Engineered complex transformation logic, including a 12-scenario SQL truth table to resolve hierarchical consent conflicts and a custom Python engine to overcome SFMC platform limitations, ensuring strict GDPR/PII compliance."
47-
- "**Wade Insight (Cloud Migration)**: Led enterprise migration from ADF to Microsoft Fabric Data Factory, managing ARM template adaptation. Enhanced the SaaS platform with advanced continue-on-failure orchestration and automated health-check processes, reducing manual troubleshooting by 80%."
48-
- "**Prospexs (GenAI Product Engineering)**: Built an AI outreach platform (Python/FastAPI/MongoDB) featuring a complex 4-entity personalization engine (Sender/Receiver × Human/Company). Integrated OpenAI/Perplexity APIs to generate bilingual, context-aware communications, improving response rates by 45%."
49-
- "**QxLab (LLM Infrastructure)**: Led a 4-person team fine-tuning Llama/Mistral models using Axolotl on H100 clusters to achieve \\>95% accuracy in production agentic system. Separately, architected a Terabyte-scale cascading deduplication pipeline (MinHash LSH + FAISS) to standardize datasets (SFT/DPO/RLHF), solving critical bottlenecks. Later released as the open-source project (DatasetPipeline)."
36+
- "**Johnson & Johnson (Healthcare CDP)**: Architected scalable Treasure Data pipelines for regulated healthcare markets (JP/ANZ). Engineered complex transformation logic, including a 12-scenario SQL truth table (validated via TDD) to resolve hierarchical consent conflicts and a custom Python engine to overcome SFMC platform limitations, ensuring strict GDPR/PII compliance."
37+
- "**Wade Insight (Cloud Migration)**: Led enterprise migration from Azure Data Factory to Microsoft Fabric Data Factory, managing ARM template adaptation. Enhanced the SaaS platform with advanced continue-on-failure orchestration and automated health-check processes, reducing manual troubleshooting by 80%."
38+
- "**Prospexs (GenAI Product Engineering)**: Built an AI outreach platform (Python/FastAPI/MongoDB) backed by a 700M+ contact database. Engineered a complex 4-entity personalization engine (Sender/Receiver × Human/Company) using OpenAI/Perplexity to generate bilingual, context-aware communications, improving response rates by 45%."
39+
- "**QxLab (LLM Infrastructure)**: Led a 4-person team fine-tuning Llama/Mistral models using Axolotl on H100 clusters to achieve \\>95% accuracy in a production DAG-based agentic system. Separately, architected a Terabyte-scale cascading deduplication pipeline (MinHash LSH + FAISS) to standardize datasets (SFT/DPO/RLHF), solving critical bottlenecks. Later open-sourced as DatasetPipeline."
5040
- "**CV Advisors (Performance Engineering)**: Replaced a 1900-line legacy SQL procedure (iterative cursors) into vectorized logic (DuckDB/Pandas), processing 80M+ records across 150 clients. Reduced runtime from 27.5h to \\<5s (\\>99.9% gain). Engineered a future-proof architecture capable of scaling beyond RAM limits via memory-optimized categorical typing and DuckDB’s out-of-core processing."
5141
# - "**Logical Contract (Legal Tech)**: Implemented an AI-powered legal tech system for generating tailored employment agreements and a legal chatbot for startup inquiries."
5242
- "**SlideNinja (GenAI RAG Architecture)**: Developed a GenAI RAG platform (LangChain/ChromaDB) for a McKinsey partner during the early GenAI boom, featuring a 'Self-Healing' AI orchestration layer and proprietary 'Geometric Layout Analysis' to map unstructured content into rigid corporate PowerPoint templates."
53-
- "**LoopKitchen (Real-time Data Ingestion)**: Built core data architecture (GCP/BigQuery/FastAPI) enabling a $6M Series A. Migrated fragile legacy scrapers to robust official API integrations (UberEats/DoorDash/Grubhub) for millions of orders and engineered an automated dispute resolution system to directly recover lost revenue."
43+
- "**LoopKitchen (Real-time Data Ingestion)**: Built core data architecture (GCP/BigQuery/FastAPI/GCP Workflows) enabling a $6M Series A. Migrated fragile legacy scrapers to robust official API integrations (UberEats/DoorDash/Grubhub) for millions of orders and engineered an automated dispute resolution system to directly recover lost revenue."
5444

5545
- company: "FiftyFive Technologies"
5646
position: "Software Engineer Intern"
@@ -91,6 +81,7 @@ cv:
9181
- "Architected a GDP-compliant cold chain monitoring system on GCP using a Medallion Architecture (Bronze/Silver/Gold/Diamond). The 'Diamond Layer' pre-aggregated alert candidates, enabling the Alerting Engine to dispatch notifications with sub-second latency."
9282
- "Engineered 'Smart Merge' logic using SQL window functions (`LAST_VALUE`) to unify asynchronous data streams (sporadic TMS updates vs. high-frequency IoT telemetry) into a continuous, context-aware timeline."
9383
- "Designed an internal 'Phone Book' architecture to decouple sensor mapping from external TMS errors, ensuring data integrity even when manual entry errors occurred in the source system."
84+
- "Automated complex manual workflows previously requiring 10-15 US-based operations staff, resulting in massive OpEx savings."
9485
- "Slashed cloud compute costs by deprecating expensive BQML (Machine Learning) models in favor of optimized SQL-based statistical analysis (rolling windows/standard deviations), reducing false positive alerts by 70%."
9586
- name: "Johnson & Johnson - Healthcare Data Platform (CDP)"
9687
date: "Feb 2025 – Jun 2025"
@@ -247,10 +238,10 @@ design:
247238
theme: engineeringresumes
248239
page:
249240
size: a4
250-
top_margin: 1cm
251-
bottom_margin: 1cm
252-
left_margin: 1cm
253-
right_margin: 1cm
241+
top_margin: 1.05cm
242+
bottom_margin: 1.05cm
243+
left_margin: 1.05cm
244+
right_margin: 1.05cm
254245
show_page_numbering: false
255246
show_last_updated_date: false
256247
colors:
@@ -287,7 +278,7 @@ design:
287278
# Use this if applying to traditional banks, law firms, or PhD research roles.
288279
# EB Garamond: The "Classy" choice. Old-style serif. Looks like a book. High prestige, low "tech" vibe.
289280
font_family: Inter
290-
font_size: 9.5pt
281+
font_size: 9.6pt
291282
alignment: justified
292283
date_and_location_column_alignment: right
293284
leading: "0.65em"
@@ -308,7 +299,7 @@ design:
308299
section_titles:
309300
type: with-full-line
310301
font_family: Inter
311-
font_size: 1.2em
302+
font_size: 1.3em
312303
bold: true
313304
small_caps: true
314305
vertical_space_above: 0.4cm
@@ -317,17 +308,17 @@ design:
317308
date_and_location_width: 4cm
318309
left_and_right_margin: 0cm
319310
horizontal_space_between_columns: 0.1cm
320-
vertical_space_between_entries: 0.8em
311+
vertical_space_between_entries: 1em
321312
allow_page_break_in_sections: true
322313
allow_page_break_in_entries: true
323314
short_second_row: false
324315
show_time_spans_in: []
325316
highlights:
326317
bullet:
327318
nested_bullet:
328-
top_margin: 0.2cm
319+
top_margin: 0.22cm
329320
left_margin: 0.3cm
330-
vertical_space_between_highlights: 0.2cm
321+
vertical_space_between_highlights: 0.22cm
331322
horizontal_space_between_bullet_and_highlight: 0.4em
332323
summary_left_margin: 0.2cm
333324
entry_types:

0 commit comments

Comments
 (0)