fix: enable positive_data_acceptance fuzz checks via spec tightening and test hooks by lugi0 · Pull Request #2638 · kubeflow/hub

lugi0 · 2026-04-24T11:37:19Z

Summary

Resolves RHOAIENG-58824: Schemathesis's positive_data_acceptance check generates schema-conformant requests and expects 2xx responses, but 53 tests fail with RejectedPositiveData. This PR closes the spec-server contract gaps and adds fuzz test hooks for server-side limitations that cannot be expressed in OpenAPI 3.0.

Result: 53 → 0 stateless fuzz test failures.

Root causes found and addressed

Spec tightening (37 failures)

Root cause	Failures	Fix
Path parameter IDs defined as `type: string` with no constraints; server requires numeric int32	19	Added `format: int64`, `pattern: "^[1-9][0-9]{0,8}$"` to all ID path params
`pageSize` query param defined as `type: string`; server parses as int32	8	Changed to `type: integer, format: int32, minimum: 1, maximum: 2147483647`
Singleton GET endpoints mark all params optional; server requires `externalId` OR `name+parentResourceId`	6	Schemathesis hook enforces valid param combinations
`filterQuery` allows any string; server lexer only accepts ASCII grammar	3	Added `pattern: "^[\x20-\x7E]*$"`
`nextPageToken` has no format constraint; server expects base64	1	Improved description (opaque cursor)

Spec + hook fixes (14 failures)

Root cause	Failures	Fix
Required string fields accept empty string `""`; server's `IsZeroValue()` treats it as missing	10	Added `minLength: 1` to required string fields in Create schemas
Catalog server missing validation middleware for null bytes	3	Added `middleware.ValidationMiddleware()` to `catalog/cmd/catalog.go`
Artifact `oneOf` discriminator not enforced	1	Hook randomizes across valid artifact types for POST; ensures required fields per type

Additional issues found during re-verification

Issue	Fix
`MetadataIntValue` spec says `format: int64` but server uses int32 (`StringToInt32`)	Changed to `format: int32`, `pattern: "^-?[0-9]{1,9}$"`
MetadataValue types allow fuzz-generated extra properties	Added `additionalProperties: false` to all 6 MetadataValue types
`MetadataProtoValue` not supported by EmbedMD converter (no case in switch)	Hook replaces ProtoValue with StringValue
`MetadataStructValue` base64 round-trip bug — write decodes, read doesn't re-encode; PATCHing resources with stored StructValues crashes	Hook replaces StructValue with StringValue (see server bug section for proposed fix)
Non-ASCII/surrogate characters in request bodies break Go JSON parser	Hook strips null bytes and surrogates via `encode("utf-8", errors="ignore")`
Non-ASCII characters in `customProperties` keys	Hook sanitizes keys to ASCII-only
Backslash/quotes in `name`/`externalId` params break server's internal filter query construction	Hook strips `\`, `'`, `"` from these params
Go strict JSON decoder rejects extra properties not in struct	Per-path property whitelist derived from spec, applied in `map_body` hook
`metadataType` field used `example` instead of `enum`	Changed to single-value `enum` for proper discriminator enforcement
Generated spec files (catalog.yaml, model-registry.yaml) were stale	Properly regenerated via `scripts/merge_openapi.sh`
Metric `step` is int64 in Go struct but `timestamp` is string — spec says both are `type: string, format: int64`	Hook sets `step` as integer, `timestamp` as string
Artifact `artifactType` is immutable but included in update schema without `readOnly`	Hook GETs existing artifact to match type on PATCH; randomizes type on POST for coverage
Schemathesis ignores `minLength`/`format` inherited through `allOf`	Hook replaces empty `name` with random value; validates numeric-string fields (IDs, timestamps)
`BaseResource.name` missing `minLength: 1`	Added at source so all Create schemas inherit it

`make test-fuzz` reliability fixes

Issue	Fix
Port-forwards start in background with no readiness check; tests fail on connection refused	Added curl retry loops (30s timeout) for model-registry (`:8080`), minio (`:9000`), local registry (`:5001`)
`$STATUS` variable never set in `test-fuzz`; target always exits 0	Combined shell commands to properly capture and propagate exit codes from both stateless and stateful runs
Prior fuzz runs leave corrupted `customProperties` in DB; PATCH tests fail loading existing resources	Added `scripts/cleanup.sh` before both stateless and stateful test runs
No stateful fuzz tests for the catalog API	Added `test_catalog_stateful.py` and included in `make test-fuzz`
Stateful test fails non-deterministically with `Unsatisfiable` after thousands of successful steps	Added `@pytest.mark.flaky(reruns=2)` — caused by tight spec constraints + allOf making data generation borderline

Design decisions

Per-path property whitelist vs additionalProperties: false: The Go server uses strict JSON decoding (DisallowUnknownFields), rejecting any property not in the struct. OpenAPI 3.0's allOf + additionalProperties: false is fundamentally broken — it evaluates per-subschema, so properties valid in the composite are rejected as "additional" in the base. Instead of fighting the spec, the map_body hook maintains a per-path whitelist (_PATH_PROPERTIES) derived from the spec, stripping fuzz-generated extra properties before they reach the server.

Hooks vs spec changes: Some server behaviors cannot be expressed in OpenAPI 3.0 (parameter dependencies, discriminator enforcement, strict decoding). These are handled via Schemathesis hooks in conftest.py rather than incorrect spec annotations.

⚠️ ID format: int64 in spec vs int32 in server — needs discussion

This PR annotates path parameter IDs with format: int64 and pattern: "^[1-9][0-9]{0,8}$". The pattern safely constrains values to the int32 range (max 999,999,999 < 2,147,483,647), so no functional issues arise. However, the server actually validates all path IDs as int32, not int64:

internal/apiutils/api_utils.go:42 — ValidateIDAsInt32() calls strconv.ParseInt(id, 10, 32).
Every path ID handler (registered_model.go, model_version.go, experiment.go, inference_service.go, serving_environment.go, artifact.go, serve_model.go, experiment_run.go) uses ValidateIDAsInt32, never ValidateIDAsInt64.
Meanwhile, the response body BaseResource.id at common.yaml:98 is already declared as format: int64 — this pre-dates this PR and is the existing convention.

We chose format: int64 for path params to stay consistent with the existing response body id field. But this creates a documented-vs-actual gap:

Layer	Declared format	Actual behavior
Response body `BaseResource.id`	`int64` (pre-existing)	Server generates int32-range IDs
Path parameters (this PR)	`int64` (matches response body)	Server validates as int32
`MetadataIntValue.int_value` (this PR)	`int32` (changed from int64)	Server uses `StringToInt32()`

Options for follow-up discussion:

Keep int64 everywhere (current state) — consistent spec, pattern constrains to safe range, but spec claims to support larger IDs than the server accepts
Change path params to int32 — accurate to server, but inconsistent with response body id which still says int64
Change everything to int32 — accurate to server across the board, but this is a larger API contract change that could affect generated clients
Fix the server to use int64 — the most forward-looking fix; the database (MySQL/Postgres) likely supports int64 already, and ValidateIDAsInt32 could be changed to ValidateIDAsInt64

Note: ValidateIDAsInt64 already exists in api_utils.go:66 — it's just never called for path parameters.

Server bugs identified (not fixed, worked around)

These are documented for separate follow-up:

MetadataProtoValue not supported — EmbedMD converter (openapi_embedmd_converter_util.go:42-78) has no case for ProtoValue in its switch statement
MetadataStructValue base64 round-trip bug — the write converter (openapi_embedmd_converter_util.go:61) base64-decodes struct_value, but the read converter (embdemd_openapi_converter_util.go:47) returns raw JSON without re-encoding. Any resource with a StructValue becomes un-PATCHable because the server re-processes stored customProperties through the write converter on PATCH. Proposed one-line fix: change line 47 from NewMetadataStructValue(string(asJSON)) to NewMetadataStructValue(base64.StdEncoding.EncodeToString(asJSON)) — identical to what the ByteValue read path already does at line 60-61
name/externalId filter injection — server constructs internal filter queries like name = '<value>' without escaping; backslash/quotes break the participle lexer
Non-ASCII in customProperties keys — Go JSON decoder fails on surrogate pairs and non-UTF-8 characters in property key names
MetadataIntValue int32/int64 mismatch — spec said int64, server uses StringToInt32() (fixed in this PR for int_value; see also ID format discussion above)
All path ID validation uses int32 (ValidateIDAsInt32) despite spec declaring int64 (see section above)
Metric step vs timestamp type inconsistency — both are type: string, format: int64 in spec, but Go struct has step as int64 (JSON number) and timestamp as string (JSON string)
Artifact artifactType immutable but in update schema — server rejects type changes on PATCH, and infers "unknown" when field is omitted; update schema should either exclude artifactType or mark it readOnly

Test plan

make openapi/validate — both specs pass
Stateless fuzz tests: 86 passed, 0 SUBFAILED, 63 subtests passed
Stateful fuzz test (model-registry): passes; @flaky(reruns=2) for occasional Unsatisfiable (see known limitation below)
Stateful fuzz test (catalog): added — test_catalog_stateful.py
make test-fuzz from scratch (kind cluster creation → deploy → test): no timing failures
Reviewer ran cd clients/python && make test-fuzz and stateless/stateful tests directly — confirmed green

⚠️ Known limitation: stateful test `Unsatisfiable` — needs further investigation

The model-registry stateful test (test_mr_api_stateful) can non-deterministically fail with hypothesis.errors.Unsatisfiable even after thousands of successful API calls. This happens when Hypothesis's state machine exhausts 1000 attempts to find a valid next transition. The failure is mitigated with @pytest.mark.flaky(reruns=2) but can still occur.

What we know:

The failure is non-deterministic — depends on the random seed. Some seeds produce runs that complete successfully; others reach a state where all generated transitions are filtered out.
It occurs AFTER extensive successful exploration (often 5000+ successful API calls), not at startup.
It is NOT caused by server errors — the state machine simply can't generate valid data for its next step.
It became more likely after tightening the OpenAPI spec constraints (patterns, minLength, format), because Schemathesis struggles to generate conformant data through allOf composition.

Root cause hypothesis:
Schemathesis does not fully enforce constraints inherited through allOf during data generation. When the spec has tight constraints (e.g., pattern: "^[1-9][0-9]{0,8}$" on IDs, minLength: 1 on names), Hypothesis generates many invalid values that get filtered, eventually hitting the 1000-filter hard limit. The hooks fix values AFTER generation, but Hypothesis's internal filtering happens BEFORE the hooks run.

Possible directions for investigation:

Schemathesis before_generate_body hook — could constrain generation strategies at the source rather than fixing values after the fact, reducing the filter rejection rate
Custom Hypothesis strategies — register custom strategies for string fields with format/pattern constraints so Hypothesis generates valid values directly
Increase Hypothesis filter tolerance — the 1000-attempt Unsatisfiable threshold is hardcoded in Hypothesis; a custom wrapper could retry with a different seed
Reduce spec constraint strictness — loosen some patterns (e.g., "^[0-9]+$" instead of "^[1-9][0-9]{0,8}$") to give Hypothesis more room, at the cost of less precise spec documentation

`--hypothesis-show-statistics` verbosity

The make test-fuzz target passes --hypothesis-show-statistics to both stateless and stateful pytest runs. This produces extensive per-test statistics output that is useful for debugging Hypothesis generation issues but noisy for routine runs. Consider removing it from the default target and keeping it as an opt-in flag (e.g., make test-fuzz STATS=1).

🤖 Generated with Claude Code

…cceptance checks Resolves RHOAIENG-58824: Schemathesis API rejects valid requests that conform to the OpenAPI spec. Reduces stateless fuzz test failures from 53 to 0 by closing spec-server contract gaps and adding test hooks for server-side limitations that cannot be expressed in OpenAPI 3.0. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…verage Addresses issues found during stateful fuzz test stabilization: - Replace MetadataStructValue with StringValue in hooks (server base64 round-trip bug makes resources with StructValue un-PATCHable) - Randomize artifact types on POST for coverage instead of hardcoding doc-artifact; ensure per-type required fields and correct value types - GET existing artifact on PATCH to match immutable artifactType - Validate numeric-string fields (IDs, timestamps) that Schemathesis fills with arbitrary Unicode despite format constraints in allOf - Replace empty name with random value (Schemathesis ignores minLength through allOf); add minLength: 1 to BaseResource.name at source - Add catalog stateful test and include in make test-fuzz - Add @pytest.mark.flaky(reruns=2) for non-deterministic Unsatisfiable Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

pboyd

I added a few comments, but it's looking good.

pboyd · 2026-04-24T12:11:29Z

      description: Number of entities in each page.
      schema:
-        type: string
+        type: integer


This change might be a problem. Can it remain a string but have an int32 format?

I believe this is due to how schemathesis would generate cases, leading to preventable failures in fuzzing tests. I can look into forcing acceptable values via schemathesis hooks, but I'm not sure why we would want to define a field as string when what we want is an integer

pboyd · 2026-04-24T12:18:01Z

      schema:
        type: string
+        format: int64
+        pattern: "^[1-9][0-9]{0,8}$"


Positive int64's could be up to 19 digits, there are a few instances of this.

For the int64 issues, I would ask you to read through the ID format: int64 in spec vs int32 in server paragraph in the description - I think we should have a discussion within the team on how to handle this moving forward.

pboyd · 2026-04-24T12:19:31Z

-          format: int64
+          format: int32
          type: string
+          pattern: "^-?[0-9]{1,9}$"


Do you want to allow leading zero's here? I see the unsigned int64 pattern prevents them.

Signed-off-by: Luca Giorgi <lgiorgi@redhat.com>

google-oss-prow · 2026-04-24T19:39:29Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign ederign for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

lugi0 and others added 2 commits April 24, 2026 13:32

google-oss-prow Bot requested review from chambridge and dbasunag April 24, 2026 11:37

google-oss-prow Bot added the size/XL label Apr 24, 2026

github-actions Bot added Area/MR Python client Area/Go REST server Area/Model Catalog labels Apr 24, 2026

pboyd reviewed Apr 24, 2026

View reviewed changes

Merge branch 'main' into fix/rhoaieng-58824-spec-tightening-upstream

6bde211

Signed-off-by: Luca Giorgi <lgiorgi@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: enable positive_data_acceptance fuzz checks via spec tightening and test hooks#2638

fix: enable positive_data_acceptance fuzz checks via spec tightening and test hooks#2638
lugi0 wants to merge 3 commits intokubeflow:mainfrom
lugi0:fix/rhoaieng-58824-spec-tightening-upstream

lugi0 commented Apr 24, 2026

Uh oh!

pboyd left a comment

Uh oh!

pboyd Apr 24, 2026

Uh oh!

lugi0 Apr 24, 2026

Uh oh!

pboyd Apr 24, 2026

Uh oh!

lugi0 Apr 24, 2026

Uh oh!

pboyd Apr 24, 2026

Uh oh!

google-oss-prow Bot commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lugi0 commented Apr 24, 2026

Summary

Root causes found and addressed

Spec tightening (37 failures)

Spec + hook fixes (14 failures)

Additional issues found during re-verification

make test-fuzz reliability fixes

Design decisions

⚠️ ID format: int64 in spec vs int32 in server — needs discussion

Server bugs identified (not fixed, worked around)

Test plan

⚠️ Known limitation: stateful test Unsatisfiable — needs further investigation

--hypothesis-show-statistics verbosity

Uh oh!

pboyd left a comment

Choose a reason for hiding this comment

Uh oh!

pboyd Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

lugi0 Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

pboyd Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

lugi0 Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

pboyd Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

google-oss-prow Bot commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

`make test-fuzz` reliability fixes

⚠️ Known limitation: stateful test `Unsatisfiable` — needs further investigation

`--hypothesis-show-statistics` verbosity