Implement Feistel-based permutation for LHS and convert samplers to stateless RNG by zachmprince · Pull Request #32833 · idaholab/moose

zachmprince · 2026-04-24T18:57:34Z

Reason

Latin Hypercube Sampling (LHS) previously relied on a stateful shuffle() call
inside sampleSetUp() to permute bin assignments. This coupling between the
generator state-machine and the sample-matrix loop made LHS incompatible with
the stateless RNG direction, prevented parallel or out-of-order sample access,
and required save/restore of generator state around every call to
sampleSetUp/sampleTearDown. The same stateful pattern was shared by the
MCMC and active-learning samplers, making the entire Sampler hierarchy harder
to reason about and test.

Design

`MooseRandomPerturbation` (new framework utility)

A header-only class implementing a keyed pseudo-random permutation of the
integers [0, n) using a balanced Feistel network:

The 64-bit seed is split into two 32-bit subkeys; each round mixes the
half-block with both subkeys and a golden-ratio-derived constant, then applies
the Murmur3/degski avalanche hash for bit diffusion.
Because n need not be a power of two the network operates on the smallest
padded domain 2^(2*half_bits) >= n and uses cycle-walking to reject
out-of-range outputs.
The permutation is bijective (every input maps to a unique output) and
invertible (invert(permute(x)) == x).
Verified by five new unit tests in unit/src/MooseRandomPerturbationTest.C
(bijection, invertibility, bit-width range, seed uniqueness, reproducibility).

Redesigned `LatinHypercubeSampler`

The sampler now uses two stateless generators:

Generator 0 — draws the uniform point inside the selected bin.
Generator 1 — seeds one MooseRandomPerturbation permuter per column.

In computeSampleRow(row, col) the bin for sample row in column col is
permuter[col].permute(row), so the full LHS sample matrix is determined
entirely by the two generator seeds. No state needs to be saved, restored, or
advanced around a setup callback. Permuters are initialised once in
executeTearDown() after the generator has been advanced to the correct offset.

Sampler base-class cleanup

sampleSetUp() and sampleTearDown() (per-row callbacks that ran inside the
sample-matrix loop and required generator save/restore) have been removed. The
simpler executeSetUp() / executeTearDown() pair (called once before/after
the entire execute()) is sufficient for all remaining use-cases.

The CommMethod enum, shuffle() template, and saveGeneratorState() /
restoreGeneratorState() methods have been removed along with it.

MCMC and active-learning samplers

PMCMCBase and every derived class that previously relied on sampleSetUp to
seed proposals now use executeSetUp() instead. The proposeSamples() interface
no longer takes a seed_value argument; three new helper methods (random(),
randomIndex(), randomIndexPair()) encapsulate stateless index-based draws so
each sampler can advance its own _rand_index counter without touching generator
state directly.

AdaptiveImportanceSampler had an off-by-one in its sample index that is also
corrected here.

Stateless conversion for remaining samplers

MorrisSampler and NestedMonteCarloSampler were also converted from the
stateful shuffle()/sampleSetUp() pattern to the stateless generator API.

Impact

Framework API changes

Removed	Replacement
`Sampler::sampleSetUp(SampleMode)`	`Sampler::executeSetUp()`
`Sampler::sampleTearDown(SampleMode)`	`Sampler::executeTearDown()`
`Sampler::shuffle(begin, end, generator_index)`	`MooseRandomPerturbation`
`Sampler::saveGeneratorState()` / `restoreGeneratorState()`	not needed with stateless RNG
`Sampler::CommMethod` enum (LOCAL, SEMI_LOCAL, NONE)	removed
`PMCMCBase::proposeSamples(seed_value, ...)`	`proposeSamples(...)` (no seed arg)

New framework utility: framework/include/utils/MooseRandomPerturbation.h
(header-only, no new .C file, no registration macro required).

Gold files for all LHS tests, several MCMC tests, and surrogate-training tests
that pass through an LHS sampler have been regolded because the new Feistel
permutation and stateless generator advancement produce a different (but equally
valid and reproducible) sample sequence.

To-Do

Things to do once reviewers are satisfied with changes.

Update figures, tables, etc. in documentation regarding LHS sampling changes:

modules/stochastic_tools/examples/parameter_study.md
modules/stochastic_tools/examples/nonlin_parameter_study.md
modules/stochastic_tools/examples/sobol.md
modules/stochastic_tools/examples/poly_regression_surrogate.md
modules/stochastic_tools/examples/pod_rb_surrogate.md
modules/stochastic_tools/examples/combined_example_2d_trans_diff.md
modules/stochastic_tools/examples/cross_validation.md
modules/combined/examples/stm_thermomechanics.md

Update applications:

Grizzly

…lab#32194

This works and is consistent in parallel, but fails most tests due to algorithmic change. Integration with MooseRandomStateless will probably require more regolding, so I'm saving this part for later. Refs idaholab#32194

zachmprince · 2026-04-24T19:14:34Z

LHS quality experiment: Feistel permutation vs. Fisher-Yates shuffle

modules/stochastic_tools/examples/lhs/lhs_experiment.{py,ipynb} contains a
numerical comparison of four samplers run against a parametric test function
family.

Test function. Each trial draws an nrow x ndim sample matrix and evaluates

$$ f(\mathbf{x}; \lambda) = \sum_c \frac{e^{x_c}}{2^{c+1}} + \lambda \cdot \frac{\sum_{{i}<{j}} L_2(x_i), L_2(x_j)}{\sqrt{d(d-1)/2}} $$

where $L_2(x) = \sqrt{12}(x - 0.5)$ is the centered Legendre polynomial. The
additive term ($\lambda = 0$) is well-suited to LHS; larger $\lambda$ values add
pairwise interaction that LHS cannot fully stratify, stress-testing each
sampler's behavior when the function is less cooperative. The exact mean of $f$
over $[0,1]^d$ is known analytically, so RMSE is computed directly.

Parameters. 4 samplers x 4 dimensions (4, 8, 16, 32) x 10 sample counts
(8 -- 4096, log-spaced) x 6 $\lambda$ values (0, 0.1, 0.3, 1, 3, 10) x 100
independent trials = 96 000 runs. The Feistel LHS calls the actual
stochastic_tools-opt binary; the others use scipy.stats.qmc.

Result. Across all 240 $(\lambda, d, N)$ combinations the ratio of Feistel
RMSE to Fisher-Yates RMSE has a mean of 0.99 and a standard deviation of
0.10, with a range of 0.81 -- 1.28. The extreme ratios appear only at the
smallest sample counts (nrow = 8) and highest dimensions (ndim = 32), where
both methods have large absolute RMSE and the ratio is dominated by Monte Carlo
noise across the 100 trials. At $N \ge 32$ the two LHS variants are
indistinguishable. The table below shows the $\lambda = 0$, $d = 8$ slice as a
representative example:

nrow	Monte Carlo	LHS (Fisher-Yates)	LHS (Feistel)	Sobol
8	0.110472	0.012540	0.013309	0.011746
16	0.064504	0.004484	0.004344	0.004860
32	0.049537	0.001694	0.001605	0.001547
64	0.029627	0.000548	0.000574	0.000454
128	0.025555	0.000186	0.000216	0.000186
256	0.015685	0.000075	0.000070	0.000053
512	0.012246	0.000027	0.000027	0.000022
1024	0.009555	0.000007	0.000009	0.000011
2048	0.006514	0.000003	0.000003	0.000005
4096	0.003950	0.000001	0.000001	0.000000

Both LHS variants produce the correct $O(N^{-1})$ convergence rate and nearly
identical absolute errors, confirming that the Feistel-network permutation is a
drop-in replacement for the Fisher-Yates shuffle used by scipy (and by the
previous stochastic_tools implementation) with no measurable quality penalty.

zachmprince · 2026-04-24T19:27:50Z

Here is a plot of the experiment results. The LHS lines (blue and red) are difficult to see since they are right on top of each other.

moosebuild · 2026-04-24T19:33:40Z

Job Precheck, step Python: black format on af7f601 wanted to post the following:

Python black formatting

Your code requires style changes.

A patch was generated and copied here.

You can directly apply the patch by running the following at the top level of your repository:

curl -s https://mooseframework.inl.gov/docs/PRs/32833/black/black.patch | git apply -v

Alternatively, you can run the following at the top level of your repository:

black --config pyproject.toml --workers 1 .

moosebuild · 2026-04-24T19:45:30Z

Job Precheck, step Clang format on f298501 wanted to post the following:

Your code requires style changes.

A patch was auto generated and copied here
You can directly apply the patch by running, in the top level of your repository:

curl -s https://mooseframework.inl.gov/docs/PRs/32833/clang_format/style.patch | git apply -v

Alternatively, with your repository up to date and in the top level of your repository:

git clang-format 162e78c71709df5be85663d216c89383cf162192

Refs idaholab#32194 Closes idaholab#32775

…aholab#32194

…32194

…lab#32194

Trivial regolding was required on timestep 1 due to previous inconsistency in what the decision reporters were outputting as inputs. Refs idaholab#32194

…makes more sense now idaholab#32194

…umber of rows idaholab#32775

moosebuild · 2026-04-24T23:50:53Z

Job Documentation, step Docs: sync website on 79ac28e wanted to post the following:

View the site here

This comment will be updated on new commits.

moosebuild · 2026-04-27T23:44:01Z

Job Coverage, step Generate coverage on 79ac28e wanted to post the following:

Framework coverage

	4e53b3	#32833 79ac28
	Total	Total	+/-	New
Rate	85.87%	85.87%	-0.00%	100.00%
Hits	132516	132544	+28	71
Misses	21798	21809	+11	0

Diff coverage report

Full coverage report

Modules coverage

Stochastic tools

	4e53b3	#32833 79ac28
	Total	Total	+/-	New
Rate	90.68%	90.69%	+0.02%	100.00%
Hits	8636	8615	-21	136
Misses	888	884	-4	0

Diff coverage report

Full coverage report

Full coverage reports

Reports

This comment will be updated on new commits.

…memory stays reasonable idaholab#32194

…lab#32194

lindsayad

Framework portion looks good. I'll let @grmnptr handle the stochastic tools review

moosebuild · 2026-04-30T18:35:34Z

Job Test, step Results summary on 79ac28e wanted to post the following:

Framework test summary

Compared against 4e53b3b in job civet.inl.gov/job/3786076.

No change

Modules test summary

Compared against 4e53b3b in job civet.inl.gov/job/3786076.

No added tests

Run time changes

Test	Base (s)	Head (s)	+/-	Base (MB)	Head (MB)
`stochastic_tools/test:reporters/BFActiveLearning.sampling_bf/MultipleProc_MultipleRow_Ufunction`	9.56	4.64	-51.47%	1295.51	310.88

zachmprince added 2 commits April 24, 2026 12:35

Implementation of Feistel mixing for pseudo-random perturbation idaho…

0dbeffe

…lab#32194

zachmprince force-pushed the new_latin_sampling branch from d0d56ce to af7f601 Compare April 24, 2026 19:26

zachmprince force-pushed the new_latin_sampling branch 3 times, most recently from 5ba8c63 to 9bff5a0 Compare April 24, 2026 21:08

zachmprince added 13 commits April 24, 2026 15:44

Converting some of the easy samplers to using stateless RNG

b3b8551

Refs idaholab#32194 Closes idaholab#32775

Removing sampleSetUp dependence of ActiveLearningMonteCarloSampler id…

78bd8a7

…aholab#32194

Using correct sample index in AdaptiveImportanceSampler idaholab#32194

2ff398f

Removing sampleSetUp dependence of ParallelSubsetSimulation idaholab#…

73da1c8

…32194

Removing sampleSetUp dependence in GenericActiveLearningSampler idaho…

40a8a1f

…lab#32194

Removing dependence on sampleSetUp in all parallel MCMC samplers

1000fb6

Trivial regolding was required on timestep 1 due to previous inconsistency in what the decision reporters were outputting as inputs. Refs idaholab#32194

Fixing bi-fidelity active learning idaholab#32194

e06941a

Removing now unused Sampler methods idaholab#32194

2091a88

Making sure LHS is executed before requesting samples idaholab#32194

5d2f35a

Needed to regold dynamic sampler tests because generator advancement …

a4db8ba

…makes more sense now idaholab#32194

Regolding all the tests that use LHS idaholab#32194

3f75bdb

Removing more unneeded sampler methods idaholab#32194

2294df6

Error test was hanging due to advancing generators with a ludicrous n…

d498eb1

…umber of rows idaholab#32775

zachmprince force-pushed the new_latin_sampling branch from 9bff5a0 to d498eb1 Compare April 24, 2026 21:44

Reducing number of random seeds in MCMC and active learning tests so …

87c06da

…memory stays reasonable idaholab#32194

zachmprince force-pushed the new_latin_sampling branch from 16a857a to 87c06da Compare April 28, 2026 14:32

Recruiting AdaptiveImportanceSampler into the executeSetUp fold idaho…

b86aa2e

…lab#32194

zachmprince marked this pull request as ready for review April 28, 2026 19:45

zachmprince requested review from grmnptr and lindsayad as code owners April 28, 2026 19:45

lindsayad reviewed Apr 28, 2026

View reviewed changes

Comment thread framework/include/samplers/Sampler.h Outdated

Comment thread framework/include/utils/MooseRandomPerturbation.h

zachmprince added 2 commits April 30, 2026 10:59

Updating subtleties in Sampler doxygen idaholab#32194 idaholab#32775

2d9487f

Moving MooseRandomPerturbation methods to a source file idaholab#32194

79ac28e

GiudGiud assigned zachmprince Apr 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Feistel-based permutation for LHS and convert samplers to stateless RNG#32833

Implement Feistel-based permutation for LHS and convert samplers to stateless RNG#32833
zachmprince wants to merge 19 commits intoidaholab:nextfrom
zachmprince:new_latin_sampling

zachmprince commented Apr 24, 2026 •

edited

Loading

Uh oh!

zachmprince commented Apr 24, 2026

Uh oh!

zachmprince commented Apr 24, 2026

Uh oh!

moosebuild commented Apr 24, 2026

Uh oh!

moosebuild commented Apr 24, 2026

Uh oh!

moosebuild commented Apr 24, 2026 •

edited

Loading

Uh oh!

moosebuild commented Apr 27, 2026 •

edited

Loading

Uh oh!

lindsayad left a comment

Uh oh!

Uh oh!

Uh oh!

moosebuild commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zachmprince commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reason

Design

MooseRandomPerturbation (new framework utility)

Redesigned LatinHypercubeSampler

Sampler base-class cleanup

MCMC and active-learning samplers

Stateless conversion for remaining samplers

Impact

Framework API changes

To-Do

Uh oh!

zachmprince commented Apr 24, 2026

LHS quality experiment: Feistel permutation vs. Fisher-Yates shuffle

Uh oh!

zachmprince commented Apr 24, 2026

Uh oh!

moosebuild commented Apr 24, 2026

Python black formatting

Uh oh!

moosebuild commented Apr 24, 2026

Uh oh!

moosebuild commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

moosebuild commented Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Framework coverage

Modules coverage

Stochastic tools

Full coverage reports

Uh oh!

lindsayad left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

moosebuild commented Apr 30, 2026

Framework test summary

Modules test summary

No added tests

Run time changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zachmprince commented Apr 24, 2026 •

edited

Loading

`MooseRandomPerturbation` (new framework utility)

Redesigned `LatinHypercubeSampler`

moosebuild commented Apr 24, 2026 •

edited

Loading

moosebuild commented Apr 27, 2026 •

edited

Loading