Skip to content

Latest commit

 

History

History
22 lines (16 loc) · 622 Bytes

File metadata and controls

22 lines (16 loc) · 622 Bytes

v2 Roadmap (Enterprise)

Planned order (low risk → high impact):

  1. EventBridge + Step Functions

    • Scheduled backfill/replay
    • Orchestrated runs with retries/timeouts
    • Manual approval + notifications
  2. Glue (Catalog + Crawler + Job)

    • Catalog tables for Silver Parquet
    • Partition management
    • Batch compaction / repartition
  3. Data Quality (Great Expectations)

    • Validate Silver outputs as a gate
    • Store validation results in S3 + metrics
  4. EMR (or EMR Serverless)

    • Large-scale joins/aggregations/historical recompute
    • Workloads too heavy for Lambda/Glue alone