Skip to content
View fiorentinjoao's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report fiorentinjoao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
fiorentinjoao/README.md

João Fiorentin

Data Engineering


Sobre mim

Engenheiro de dados com foco em pipelines escaláveis, arquitetura de dados moderna e boas práticas de engenharia de software. Atualmente na Motorista PX, construindo soluções de dados que impactam operações em escala nacional.


Stack


Projetos em destaque

Projeto Descrição Stack
brazillian-ecommerce-lakehouse Pipeline batch end-to-end com arquitetura Medallion usando dados reais do Olist Airflow · PySpark · dbt · Delta Lake · MinIO
realtime-fraud-detection-pipeline Detecção de fraudes em tempo real em transações financeiras simuladas Kafka · Spark Streaming · PostgreSQL
dbt-nyc-taxi-analytics Projeto dbt com DuckDB analisando dados de táxi de Nova York — sem cloud dbt · DuckDB · GitHub Actions
data-ingestion-cli CLI Python para ingestão de APIs públicas com arquitetura limpa e testes Python · Typer · SQLAlchemy · pytest
airflow-best-practices 5 DAGs demonstrando padrões modernos do Airflow 2.x Airflow · TaskFlow · Sensors · SQL Checks

Popular repositories Loading

  1. cdd cdd Public

    Compression Driven Development — spec → plan → build → understanding visual

    Python 1

  2. fiorentinjoao fiorentinjoao Public

  3. brazillian-ecommerce-lakehouse brazillian-ecommerce-lakehouse Public

    End-to-end batch data pipeline with Airflow, PySpark, dbt and Delta Lake using Olist dataset

    Python

  4. realtime-fraud-detection-pipeline realtime-fraud-detection-pipeline Public

    Real-time fraud detection with Kafka, Spark Structured Streaming and PostgreSQL

    Python

  5. dbt-nyc-taxi-analytics dbt-nyc-taxi-analytics Public

    dbt project analyzing NYC Taxi data with DuckDB - runs 100% locally, no cloud needed

  6. data-ingestion-cli data-ingestion-cli Public

    Python CLI tool for ingesting data from public APIs (Open-Meteo, IBGE) into PostgreSQL

    Python