[FEATURE] Internal Pydantic schemas for ExpectationValidationResult.result (validation-result-schemas) by joshua-stauffer · Pull Request #11869 · great-expectations/great_expectations

joshua-stauffer · 2026-05-07T18:24:47Z

Summary

This PR adds an internal typed layer over ExpectationValidationResult.result, providing:

Pydantic v1 schema families (MapResult, AggregateResult, per-expectation overrides) covering the (ResultFormat × engine × core_expectation) divergence space for all ~61 core expectations
Additive EVR.as_typed(engine_hint=None) accessor that parses the result dict into the matching schema variant without mutating anything
Deterministic matrix runner (test_validation_result_schemas_matrix.py) that walks every (core_expectation × engine × ResultFormat) cell against canonical fixtures, asserts conformance, and emits a structured JSON findings artifact
CI artifact upload via actions/upload-artifact@v4 so findings from all backends are retrievable by gh run download

This is internal-only work on v1. No @public_api symbols added, no marshmallow change, no serialization change, no great_expectations/__init__.py re-export. Existing consumers of .result are unaffected.

Non-breaking guarantees

EVR.result continues to be Dict[str, Any] — no change for callers that don't opt in to as_typed
to_json_dict() output is byte-identical before and after calling as_typed()
ExpectationValidationResultSchema (marshmallow) is untouched
No new top-level dependencies

Matrix runner findings

The pandas-only run (440 cells: 61 cases × 4 ResultFormats × 2 pandas datasources) passes with 0 failures after schema gap fixes. The full matrix (2880 cells) runs on all 12 configured backends; cloud-credentialed backends are skipped locally but CI shards cover them.

Known divergences surfaced by the matrix (queued for v2 reconciliation):

expect_column_distinct_values_to_be_in_set / expect_column_distinct_values_to_equal_set: classified as aggregate but emit map-style fields (unexpected_count, partial_unexpected_list, etc.)
expect_column_values_to_be_of_type / expect_column_values_to_be_in_type_list: classified as map but only return observed_value on SQL/Spark

These 4 expectations represent intentional cross-engine divergences that v2 will reconcile. They are logged in the findings JSON as status=failed entries — exactly the intended output of this spec.

Findings artifact

Each CI shard uploads its findings to validation-result-schemas-findings (via .github/actions/upload-validation-result-schemas-findings/action.yml). The curator retrieves them with:

gh run download <run-id> -R great-expectations/great_expectations \
  -n validation-result-schemas-findings \
  -D artifacts/validation_result_schemas/findings/<run-id>/

See docs/spec-v2/validation-result-schemas-handoff-template.md in the gx_maintainer repo for the full curation workflow.

Codecov Report

❌ Patch coverage is 98.75519% with 3 lines in your changes missing coverage. Please review.
✅ Project coverage is 81.20%. Comparing base (b77da1f) to head (f1f8847).
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
...core/validation_result_schemas/findings_emitter.py	95.12%	2 Missing ⚠️
...expectations/core/expectation_validation_result.py	90.90%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop   #11869      +/-   ##
===========================================
- Coverage    84.79%   81.20%   -3.60%     
===========================================
  Files          471      481      +10     
  Lines        39171    39412     +241     
===========================================
- Hits         33217    32005    -1212     
- Misses        5954     7407    +1453

Flag	Coverage Δ
3.10	`73.97% <98.75%> (+0.40%)`	⬆️
3.10 athena	`?`
3.10 aws_deps	`?`
3.10 big	`?`
3.10 clickhouse	`?`
3.10 filesystem	`?`
3.10 mysql	`?`
3.10 openpyxl or pyarrow or project or sqlite or aws_creds	`?`
3.10 postgresql	`?`
3.10 singlestore	`?`
3.10 spark_connect	`?`
3.10 sql_server	`?`
3.11	`74.01% <98.75%> (+0.40%)`	⬆️
3.11 athena	`?`
3.11 aws_deps	`?`
3.11 big	`?`
3.11 clickhouse	`?`
3.11 filesystem	`?`
3.11 mysql	`?`
3.11 openpyxl or pyarrow or project or sqlite or aws_creds	`?`
3.11 postgresql	`?`
3.11 singlestore	`?`
3.11 spark_connect	`?`
3.11 sql_server	`?`
3.12	`74.02% <98.75%> (+0.40%)`	⬆️
3.12 athena	`?`
3.12 aws_deps	`?`
3.12 big	`?`
3.12 filesystem	`?`
3.12 mysql	`?`
3.12 openpyxl or pyarrow or project or sqlite or aws_creds	`?`
3.12 postgresql	`?`
3.12 singlestore	`?`
3.12 spark_connect	`?`
3.12 sql_server	`?`
3.13	`74.02% <98.75%> (+0.40%)`	⬆️
3.13 athena	`42.02% <66.39%> (+0.16%)`	⬆️
3.13 aws_deps	`45.26% <66.39%> (+0.14%)`	⬆️
3.13 big	`55.27% <66.39%> (+0.07%)`	⬆️
3.13 bigquery	`?`
3.13 clickhouse	`42.03% <66.39%> (+0.16%)`	⬆️
3.13 databricks	`?`
3.13 filesystem	`64.65% <84.23%> (+0.35%)`	⬆️
3.13 gx-redshift	`?`
3.13 mysql	`52.10% <85.89%> (+0.37%)`	⬆️
3.13 openpyxl or pyarrow or project or sqlite or aws_creds	`60.11% <85.89%> (+0.20%)`	⬆️
3.13 postgresql	`?`
3.13 singlestore	`47.15% <66.39%> (+0.13%)`	⬆️
3.13 snowflake 1/3	`?`
3.13 snowflake 2/3	`?`
3.13 snowflake 3/3	`?`
3.13 spark	`?`
3.13 spark_connect	`46.91% <66.39%> (+0.13%)`	⬆️
3.13 sql_server	`53.53% <84.23%> (+0.37%)`	⬆️
3.13 trino	`48.78% <66.39%> (+0.12%)`	⬆️
cloud	`0.00% <0.00%> (ø)`
docs-basic	`?`
docs-creds-needed	`?`
docs-spark	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

…; add test files to mypy exclude

Copilot

Pull request overview

This PR introduces an internal typed “schema layer” over ExpectationValidationResult.result using Pydantic v1 models, plus a dispatcher-based ExpectationValidationResult.as_typed() accessor and a matrix-style integration runner that emits a structured findings JSON artifact (uploaded in CI).

Changes:

Add internal schema families for map-style and aggregate-style result payloads, plus per-expectation overrides and a dispatcher to select the correct variant.
Add ExpectationValidationResult.as_typed(engine_hint=None) to parse (without mutating) .result into the appropriate typed model.
Add unit/integration tests and a CI artifact upload step for matrix-run findings.

Reviewed changes

Copilot reviewed 28 out of 32 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
tests/unit/core/validation_result_schemas/test_schemas_overrides.py	Unit tests for per-expectation schema override behavior.
tests/unit/core/validation_result_schemas/test_schemas_map.py	Unit tests for map-family schema variants and validators.
tests/unit/core/validation_result_schemas/test_schemas_aggregate.py	Unit tests for aggregate-family schema variants and observed_value/details shapes.
tests/unit/core/validation_result_schemas/test_runner_helpers.py	Unit tests for matrix-runner helper utilities (coverage assertion, summarization, engine normalization).
tests/unit/core/validation_result_schemas/test_format_config.py	Unit tests for internal ResultFormatConfig TypedDict expectations around `parse_result_format()`.
tests/unit/core/validation_result_schemas/test_findings_emitter.py	Unit tests for findings JSON emission (determinism, env var, atomic writes).
tests/unit/core/validation_result_schemas/test_field_validators.py	Unit tests for shared validator functions (runtime type classification, root validator behavior).
tests/unit/core/validation_result_schemas/test_dispatcher.py	Unit tests validating dispatcher routing (family, formats, overrides, ParseError behavior).
tests/unit/core/validation_result_schemas/test_cases_table.py	Unit tests asserting EXPECTATION_CASES integrity vs core expectations.
tests/unit/core/validation_result_schemas/test_as_typed.py	Unit tests for EVR.as_typed() correctness and non-mutation guarantees.
tests/unit/core/validation_result_schemas/init.py	Package init to support the new unit-test module layout.
tests/unit/core/init.py	Package init to support unit-test module imports.
tests/unit/init.py	Package init to support unit-test module imports.
tests/integration/data_sources_and_expectations/expectations/test_validation_result_schemas_matrix.py	Integration matrix runner that validates schema parsing across expectations/engines/formats and writes findings.
tests/integration/data_sources_and_expectations/expectations/_validation_result_schemas_helpers.py	Shared helper functions for the matrix runner (engine normalization, coverage checks, summarization).
tests/integration/data_sources_and_expectations/expectations/_validation_result_schemas_cases.py	Canonical case table (one entry per core expectation) feeding the matrix runner.
tests/conftest.py	Adds `--vrs-run-id` CLI option for naming findings output.
pyproject.toml	Registers `no_xdist` marker and updates mypy excludes for selected new tests.
great_expectations/core/validation_result_schemas/types.py	Adds internal enums and TypedDicts for findings metadata/types.
great_expectations/core/validation_result_schemas/schemas/per_expectation_overrides.py	Adds override schema(s) for engine-specific divergences.
great_expectations/core/validation_result_schemas/schemas/map_result.py	Adds map-family Pydantic models and validator wiring.
great_expectations/core/validation_result_schemas/schemas/aggregate_result.py	Adds aggregate-family Pydantic models.
great_expectations/core/validation_result_schemas/schemas/init.py	Re-exports schema models for internal consumption.
great_expectations/core/validation_result_schemas/format_config.py	Adds ResultFormatConfig TypedDict used by internal dispatch logic.
great_expectations/core/validation_result_schemas/findings_emitter.py	Adds deterministic, atomic JSON findings writer with env/dir resolution.
great_expectations/core/validation_result_schemas/field_validators.py	Adds reusable validator helpers shared across schema families.
great_expectations/core/validation_result_schemas/dispatcher.py	Adds `as_typed()` dispatcher, family table, override table, and ParseError wrapping.
great_expectations/core/validation_result_schemas/init.py	Re-exports dispatcher entrypoints for the internal package.
great_expectations/core/expectation_validation_result.py	Adds `ExpectationValidationResult.as_typed(engine_hint=None)`.
.gitignore	Ignores `tests/_artifacts/` findings output directory.
.github/workflows/ci.yml	Uploads findings artifact in CI jobs (always()).
.github/actions/upload-validation-result-schemas-findings/action.yml	Composite action to upload `tests/_artifacts/validation_result_schemas/findings/` as a CI artifact.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+      1. environment variable GX_VALIDATION_FINDINGS_DIR if set
+      2. else _DEFAULT_DIR (gitignored in the gx repo)


+    # We can't easily test the true default without writing to the actual filesystem,
+    # so we verify that FindingsWriter resolves to _DEFAULT_DIR by checking
+    # the resolved path stored on the instance.
+    with patch("os.makedirs"):  # prevent actual dir creation


+    def as_typed(self, *, engine_hint: Optional[str] = None):
+        """Return a typed view of self.result without mutating anything.
+
+        Lazy-imports the dispatcher to avoid an import cycle at module load.
+        Reads expectation_type from self.expectation_config.type and ResultFormat
+        from self.expectation_config.kwargs.get('result_format', DEFAULT_RESULT_FORMAT).
+        Returns the parsed model. Raises ParseError on validation failure.
+
+        engine_hint: optional 'pandas' | 'spark' | 'sql'. When supplied, the
+            dispatcher uses it directly. When None, the dispatcher sniffs from the
+            result dict shape.


+cannot be validated; they produce ``status=failed`` findings and the corresponding
+test cells are marked as failures — this is expected and documented here.


+def _generate_run_id() -> str:
+    """Generate a time-stamped run ID when ``--vrs-run-id`` is not supplied."""
+    ts = datetime.datetime.now(datetime.timezone.utc).strftime("%Y-%m-%dT%H-%M-%SZ")
+    suffix = "".join(random.choices(string.ascii_lowercase + string.digits, k=6))
+    return f"{ts}-{suffix}"


+    try:
+        raw_evr = batch_for_datasource.validate(
+            case.expectation,
+            result_format=result_format,  # type: ignore[arg-type]
+        )
+    except Exception as exc:
+        _findings_writer.write_finding(
+            {
+                "expectation_type": expectation_type,
+                "result_format": result_format.value,
+                "engine": engine_hint,
+                "datasource_test_id": datasource_test_id,
+                "status": Status.FAILED.value,
+                "error_summary": f"batch.validate raised: {type(exc).__name__}: {exc}",
+            }
+        )
+        pytest.fail(
+            f"[{case.id}][{result_format.value}][{engine_hint}]: "
+            f"batch.validate raised {type(exc).__name__}: {exc}"
+        )


+    # values dict during root validation.  ``exclude=True`` is not used here
+    # because pydantic v1's per-field exclude is Config-based; callers that want
+    # to omit this field from .dict() output should call .dict(exclude={"engine_hint"}).


joshua-stauffer added 18 commits May 7, 2026 16:45

chore: gitignore tests/_artifacts/ to keep validation findings out of…

ee0cbde

… git Satisfies requirement 4.3: the default findings path for validation result schema runs (tests/_artifacts/validation_result_schemas/findings/) must not be committed to the repository as test output.

feat(validation-result-schemas): create package skeleton (task 1.3)

ee6858c

feat(validation-result-schemas): ResultFormatConfig TypedDict + unit …

8c8ce6b

…tests (task 2.1)

feat(validation-result-schemas): Status, RuntimeTypeName, CellCoordin…

e2a3b01

…ates, Finding types (task 2.2)

feat(validation-result-schemas): field_validators + unit tests (task …

be26072

…3.1)

feat(validation-result-schemas): FindingsWriter + unit tests (task 3.2)

24f802e

test(validation-result-schemas): MapResult family unit tests — RED ph…

f14afa4

…ase (task 4.1)

feat(validation-result-schemas): MapResult family implementation (tas…

ac233e2

…k 4.1)

feat(validation-result-schemas): AggregateResult family + unit tests …

f0e2f9f

…(task 4.2)

feat(validation-result-schemas): per_expectation_overrides + schemas …

a4ba92c

…__init__ re-exports (task 4.3)

feat(validation-result-schemas): as_typed dispatcher + family_for + u…

847e9f1

…nit tests (task 5.1)

feat(validation-result-schemas): EVR.as_typed method + unit tests (ta…

fa17489

…sk 6.1)

feat(validation-result-schemas): EXPECTATION_CASES table + unit tests…

f8560f1

… (task 7.1)

feat(validation-result-schemas): matrix runner helpers + unit tests (…

78b0a43

…task 7.2)

feat(validation-result-schemas): matrix runner pandas slice + dispatc…

994d5af

…her fix (task 7.3)

feat(validation-result-schemas): expand matrix runner to ALL_DATA_SOU…

f6e7cd6

…RCES (task 8.1)

feat(validation-result-schemas): CI artifact upload for findings (tas…

3e6ec4e

…k 9.1)

fix(validation-result-schemas): schema gap fixes from matrix run — un…

68a88e6

…expected_count, observed_value fields (task 11.1)

[pre-commit.ci] auto fixes from pre-commit.com hooks

23a5e45

for more information, see https://pre-commit.ci

fix(validation-result-schemas): remove unused type: ignore in emitter…

f1f8847

…; add test files to mypy exclude

joshua-stauffer marked this pull request as ready for review May 7, 2026 19:03

Copilot AI review requested due to automatic review settings May 7, 2026 19:03

Copilot started reviewing on behalf of joshua-stauffer May 7, 2026 19:04 View session

Copilot AI reviewed May 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Internal Pydantic schemas for ExpectationValidationResult.result (validation-result-schemas)#11869

[FEATURE] Internal Pydantic schemas for ExpectationValidationResult.result (validation-result-schemas)#11869
joshua-stauffer wants to merge 20 commits into
developfrom
f/v2/validation-result-schemas

joshua-stauffer commented May 7, 2026

Uh oh!

netlify Bot commented May 7, 2026 •

edited

Loading

Uh oh!

codecov Bot commented May 7, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		1. environment variable GX_VALIDATION_FINDINGS_DIR if set
		2. else _DEFAULT_DIR (gitignored in the gx repo)

		cannot be validated; they produce ``status=failed`` findings and the corresponding
		test cells are marked as failures — this is expected and documented here.

Conversation

joshua-stauffer commented May 7, 2026

Summary

Non-breaking guarantees

Matrix runner findings

Findings artifact

Related

Uh oh!

netlify Bot commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for niobium-lead-7998 canceled.

Uh oh!

codecov Bot commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

netlify Bot commented May 7, 2026 •

edited

Loading

codecov Bot commented May 7, 2026 •

edited

Loading