Description
This issue tracks the native execution gaps in Delta Lake MoR (Merge-on-Read) read and DML paths on the Velox backend, focusing on deletion-vector semantics, fallback reduction, and performance.
The goal is to move Delta MoR in Gluten from partial prototype status to stable native execution.
This tracker is intended to organize the work and align with Delta Lake’s MoR design, deletion vector protocol, and Gluten’s lakehouse integration.
Related issues:
Related context:
Current status
Current PoC work has already made progress in the following areas:
- Delta DV/MoR read foundation has been prototyped in Gluten + Velox.
- Native DV scan/read path is partially working.
- Fallback still exists in some control-plane and non-hot-path operations.
- Native DELETE path has been explored, but MoR DML is not yet complete.
- UPDATE / MERGE are not yet fully native.
Scope
This issue tracks Delta MoR work in the Velox backend, including:
- native reads for Delta tables with deletion vectors
- native DML paths that generate or update deletion vectors
- protocol correctness for DV descriptors and action handling
- performance and fallback reduction for MoR workloads
Out of scope:
- generic Delta CoW improvements unless directly required by MoR
- non-Velox backend work
- unrelated lakehouse features
Priority
P0
P1
P2
Work areas
1. MoR read path
2. MoR write path
3. Delta protocol alignment
4. Performance
5. Testing and validation
Success criteria
- Supported Delta MoR read queries execute natively with zero or near-zero fallback
- Native DV read results match vanilla Spark / Delta semantics
- DELETE path is stable and mostly native
- Gluten/Velox MoR performance is competitive or improved over vanilla Spark on representative workloads
Gluten version
main branch
Description
This issue tracks the native execution gaps in Delta Lake MoR (Merge-on-Read) read and DML paths on the Velox backend, focusing on deletion-vector semantics, fallback reduction, and performance.
The goal is to move Delta MoR in Gluten from partial prototype status to stable native execution.
This tracker is intended to organize the work and align with Delta Lake’s MoR design, deletion vector protocol, and Gluten’s lakehouse integration.
Related issues:
Related context:
Current status
Current PoC work has already made progress in the following areas:
Scope
This issue tracks Delta MoR work in the Velox backend, including:
Out of scope:
Priority
P0
P1
(e.g. Delta helper queries, JSON/log handling, histogram aggregation)
P2
Work areas
1. MoR read path
2. MoR write path
3. Delta protocol alignment
u/p/iDV descriptors(path, deletionVector.uniqueId)4. Performance
5. Testing and validation
Success criteria
Gluten version
main branch