Add failure persistence support for regression testing #215

BowTiedRadone · 2025-12-31T19:14:00Z

This PR implements failure persistence and regression testing for Rendezvous. When tests fail, the failing seeds and configuration used are now automatically saved to .rendezvous-regressions/, allowing users to replay them on subsequent runs to prevent regressions.

Key additions

New --regr CLI flag to control test execution: if present, run only the regression tests
Persistent failure storage grouped by test type (invariant/property) with automatic replay
Failures sorted by timestamp for chronological tracking

Sample failure file
.rendezvous-regressions/ST1PQHQKV0RJXZFY1DGX8MNSNYVE3VGZJSRTPGZGM.counter.json:

{
  "invariant": [
    {
      "seed": 901717247,
      "dial": "example/sip010.cjs",
      "numRuns": 15,
      "timestamp": 1767886534833
    },
    {
      "seed": -1374686468,
      "numRuns": 9,
      "timestamp": 1767886531457
    },
    {
      "seed": 1298457354,
      "dial": "example/sip010.cjs",
      "numRuns": 20,
      "timestamp": 1767883389583
    },
  ],
  "test": [
    {
      "seed": 1656313995,
      "numRuns": 6,
      "timestamp": 1767886553125
    },
    {
      "seed": 64830639,
      "numRuns": 11,
      "timestamp": 1767886546477
    },
    {
      "seed": 1593583466,
      "numRuns": 3,
      "timestamp": 1767886542907
    },
  ]
}

This brings Rendezvous closer to production-grade fuzzing by ensuring discovered bugs stay caught, while improving internal code organization and maintainability.

Closes #130.

…logging

… with `property.ts`

… parsing tests

BowTiedRadone · 2026-01-08T16:01:08Z

Since --mode is now binary and --mode=new doesn't have any effect, a --regr option would be more suitable. To update.

app.tests.ts

moodmosaic · 2026-01-12T14:28:15Z

property.ts

+        // If the number of runs that failed is less than 100, set it to the
+        // default value of 100. If more runs were needed to reproduce the
+        // failure, use the number of runs that failed.
+        runs: regression.numRuns < 100 ? 100 : regression.numRuns,


If a failure occurred at run 50, why force 100 runs? This could mask issues where failures only occur early in the sequence.

This is an extra safety mechanism. In practice, it means a historical failure might have failed after 5 runs, but the same sequence of events (seed) can be considered passing if, for a given user configuration (seed, dial, etc.), it passes the default number of runs used by Rendezvous. However, you raised a great point; this behavior should probably be documented more clearly/explicitly.

Let me know what you think, but please also consider the fact that only unique seeds are stored per test type (invariant/test). If a new failure happens for a different number of runs but same seed, it won't be persisted.

BowTiedRadone added 13 commits December 31, 2025 20:54

Fix Clarity error reporting and update reporter for consistency

e0fdb18

Persist failed seeds for regression testing

7b01545

Make property testing routine asynchronous

21a8bed

Add new mode option for regression testing

b3ebaa4

Update help message to include new option and refine it

85bfd40

Make jest runner silent to suppress misleading test stdout/stderr

bd31ed8

Update regression folder name

d77adba

Make TestMode binary

acdc168

Update .gitignore

b82b3d5

Implement regression checking inside property testing routine

8235153

Improve logging

0105a3e

Update command-line args handling tests

10268f9

Update property to use loadFailures and create persistence testfile

0fa8979

BowTiedRadone force-pushed the feat/failure-persistence branch from 97dbfad to 0fa8979 Compare January 5, 2026 19:10

BowTiedRadone added 2 commits January 6, 2026 23:56

Make failure persistence module synchronous

bf4105b

Add property-based tests for the failure persistence module

ea642ea

BowTiedRadone force-pushed the feat/failure-persistence branch from 65ce54d to ea642ea Compare January 6, 2026 21:57

BowTiedRadone added 7 commits January 7, 2026 00:13

Include counterexample numRuns in the regression object

21872e2

Add dial field to FailureRecord interface

40d0e42

Use switch conditional in checkProperties and improve regression …

80c2655

…logging

PoLA and maintainability updates in property.ts

6f28f9c

Prepare invariant.ts for failure persistence similarly side-by-side…

07de377

… with `property.ts`

Enable failure persistence for invariant and add command-line options…

97b2ff3

… parsing tests

Add tests for numRuns and dial persistence fields

2b035c5

BowTiedRadone added 2 commits January 8, 2026 20:49

Update --mode <string> to --regr

ad77d7f

Add regression testing docs

fb1659f

BowTiedRadone changed the title ~~[DRAFT] Add failure persistence support for regression testing~~ Add failure persistence support for regression testing Jan 8, 2026

BowTiedRadone marked this pull request as ready for review January 8, 2026 18:57

BowTiedRadone requested a review from a team as a code owner January 8, 2026 18:57

wileyj reviewed Jan 8, 2026

View reviewed changes

app.tests.ts Outdated Show resolved Hide resolved

Extract log divider to a constant in shared.ts

6218b3c

BowTiedRadone requested review from moodmosaic and wileyj January 12, 2026 12:15

Remove unnecessary .gitignore line

21d069c

moodmosaic reviewed Jan 12, 2026

View reviewed changes

BowTiedRadone requested a review from moodmosaic January 13, 2026 15:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add failure persistence support for regression testing #215

Add failure persistence support for regression testing #215

Uh oh!

BowTiedRadone commented Dec 31, 2025 •

edited

Loading

Uh oh!

BowTiedRadone commented Jan 8, 2026

Uh oh!

Uh oh!

moodmosaic Jan 12, 2026 •

edited

Loading

Uh oh!

BowTiedRadone Jan 12, 2026

Uh oh!

BowTiedRadone Jan 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add failure persistence support for regression testing #215

Are you sure you want to change the base?

Add failure persistence support for regression testing #215

Uh oh!

Conversation

BowTiedRadone commented Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BowTiedRadone commented Jan 8, 2026

Uh oh!

Uh oh!

moodmosaic Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

BowTiedRadone Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

BowTiedRadone Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

BowTiedRadone commented Dec 31, 2025 •

edited

Loading

moodmosaic Jan 12, 2026 •

edited

Loading

BowTiedRadone Jan 12, 2026 •

edited

Loading