- ๐ง Open Data Lakehouse: Architecting native Google Cloud Lakehouses migrating from Databricks DLT to Apache Iceberg, Apache Spark (Dataproc Serverless), Biglake Metastore Iceberg Catalog, and BigQuery.
- โ๏ธ Reactive Orchestration: Designing event-driven, asset-driven data pipelines using Airflow 3 on GKE Autopilot, managing 800+ dynamic pipelines for 500+ users.
- ๐ค Agentic Data Engineering: Building and integrating local AI agent environments using Model Context Protocol (MCP), plugins, connectors, and custom agent skills to automate SQLX/Dataform, testing, and Git workflows.
- ๐ก๏ธ IaC & Data Governance: Provisioning multi-project cloud topologies (Hub, Dev, Prod, Gov) via Terraform and implementing metadata-driven Data quality profiling & business-glossary-as-code.
- ๐ข Senior Data Engineer at Logical Position
- โก๏ธ Daily stack:
.py,.sqlx(Dataform),.tf,.sh,.go,.json,.yaml - ๐ Active contributor within the Python & Modern Data Stack communities
- ๐ฑ Learning all about Agentic Orchestration, OpenTelemetry (OTel) tracing for pipeline observability, and Iceberg catalog maintenance at scale.
- ๐ฌ Ping me about data lakehouses, Apache Iceberg, Airflow 3, AI Agents & MCP, and cloud automation.
- ๐ซ Reach me: twitter.com/litanpdx




