I am a German student, who is passionate about Platform & Cloud Engineering as well as Data Engineering and the intersection of both. Currently focusing on cloud system design and event-streaming.
Cloud / Data Platform 🏗
- Distributed System on Aws streaming earthquakes in real-time
- Custom Terraform provider extending the AWS ECR provider package
- Sample Data Lakehouse architecture, deployed in containers
Open source contributions 💡
Click to expand
Project | Added | Link |
---|---|---|
Apache Airflow | Functionality and respective unit tests to export and import roles including permissions using the Airflow CLI | Merged Pull-Request |
Apache Airflow | Changed the Airflow docker-compose to easily ingest custom config files and added relevant documentation | Merged Pull-Request |
PM4PY | Functionality to filter for a maximum coverage percentage of graph variants | Merged Pull-Request |
Apache Airflow | Added missing documentation for an Operator | Merged Pull-Request |
Apache Airflow | Changed the Kubernetes JobOperator to solve an existing race condition | Closed Pull-Request |
Apache Airflow | Changed internals of db export-archived command to write table rows in batches and not run into OOM issues | Merged Pull-Request |
Apache Airflow | Fixed Regression with the RDS Operator | Merged Pull-Request |
Apache Airflow | Cleaned various AWS operator's Constructors and added tests to avoid regression | Merged Pull-Request |