Migrate from Spark's typesystem to Substrait's typesystem (SparkSchema, SparkPrimitive).  This will likely require providing a translation layer for serialized types.

## Goals

#### Phase 0: Rebase off of Substrait-Scala
- [ ] Rip out Mimir for the time being
- [ ] Migrate to substrait-scala

#### Phase 1a: ExecutionContext
- [ ] Modify `ExecutionContext` to support the creation of Substrait based artifacts.  Substrait's standard protobuf-based encoder should work for storage, and we can use a new MIME type to distinguish Substrait-based Datasets.  
- [ ] Modify `ExecutionContext` and `Artifact` to allow spark dataframe methods to work with Substrait-based plans.
- [ ] Modify `ExecutionContext` to allow DataframeConstructor-based artifacts to be retrieved as Substrait plans

#### Phase 1b: QueryExecutor
- [ ] Add a new `QueryExecutor` trait / object that accepts a Substrait plan and executes it, producing results in some standard format (Array of Row?)

#### Phase 2: Migration
- [ ] Rewrite existing Vizier Commands to use substrait-based `ExecutionContext` operations
- [ ] Rewrite the Vizier spreadsheet to be based on Substrait
- [ ] Rewrite any remaining Vizier code to replace SparkPrimitive/SparkSchema references with the corresponding Substrait types

#### Phase 3: Extract Logic to Plugins
- [ ] Factor the Spark-specific code out into a plugin
- [ ] Update `ExecutionContext`, `Artifact`, and any other code to remove all Spark-specific operations
- [ ] Add a default executor based on SQLite (or DuckDB?)
- [ ] Factor the Mimir-specific code out into a plugin

## Visualizations

<img width="846" height="777" alt="Image" src="https://github.com/user-attachments/assets/64cf4512-1927-4bd3-bea4-565377ce16f0" />

<img width="1271" height="1078" alt="Image" src="https://github.com/user-attachments/assets/1339051a-4963-463a-ab2e-467f4f1288d3" />



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Migrate from Spark's typesystem to Substrait's typesystem (SparkSchema, SparkPrimitive). This will likely require providing a translation layer for serialized types. #325

Goals

Phase 0: Rebase off of Substrait-Scala

Phase 1a: ExecutionContext

Phase 1b: QueryExecutor

Phase 2: Migration

Phase 3: Extract Logic to Plugins

Visualizations

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Migrate from Spark's typesystem to Substrait's typesystem (SparkSchema, SparkPrimitive). This will likely require providing a translation layer for serialized types. #325

Description

Goals

Phase 0: Rebase off of Substrait-Scala

Phase 1a: ExecutionContext

Phase 1b: QueryExecutor

Phase 2: Migration

Phase 3: Extract Logic to Plugins

Visualizations

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions