[WIP] Diagnostics: Fix Restart with Start/Stop Moving Window #6399

ax3l · 2025-11-16T22:35:04Z

After restarts, all diagnostics lo/hi bounds were not properly restored. This was specifically the case if the moving window had a non-zero start step and/or a stop step before the checkpoint step. After a restart, this causes new checkpoints and diagnostics to become corrupted, as the wrong spatial data gets filtered.

This fixes it. Existing checkpoints (from simulations that started from 0) are still readable with this fix.

Fix #6392

To Do

debug
semi-vibe debug (Cursor)
semi-vibe fix
review & clean up

Cleanup Notice

This bug shows how risky it is to duplicate moving-window logic at multiple places. The moving/shift logic should go into functions and they should be re-used. This PR makes this anti-pattern even worse by doubling down.

A follow-up or additional commit should deduplicate the logic to make it safer: #6400

After restarts, all diagnostics lo/hi were not properly restored. This causes new checkpoints and diagnostics to become corrupted, as the wrong spatial data gets filtered. This fixes it.

ax3l · 2025-11-16T23:23:49Z

@titoiride can you potentially test this PR with your data, too? :) Please restart from a checkpoint that was written from a simulation that ran from the beginning (not a checkpoint created by a restarted simulation).

RemiLehe

Thanks a for this PR!

In addition to adapting the code to the start/end of the moving window, it seems that this PR is making more changes. Was this intentional?

For instance, the previous code used warpx.getmoving_window_x() to infer the current position of the moving window, while the new code relies instead on current_step to infer the current position.

In principle, warpx.getmoving_window_x() should be able to work with starting/stopping moving window. To avoid duplicating code (as you pointed out), should we remove the function warpx.getmoving_window_x() (since it is not used anymore) or should we try to fix it and introduce it again?

ax3l · 2025-11-17T15:52:06Z

Source/Diagnostics/Diagnostics.cpp

-        const amrex::Real displacement =
-            warpx.getmoving_window_x() - warpx.Geom(0).ProbLo(moving_dir);
-        const int shift_num_base = static_cast<int>
-            (displacement / warpx.Geom(0).CellSize(moving_dir));


The issue here is subtle.

In FullDiagnostics::MovingWindowAndGalileanDomainShift this shift is only done for steps where if (WarpX::moving_window_active(step+1)) an accumulates the m_lo/m_hi on truncated integer (cell) locations.

The implementation here omitted these details and thus introduces a drift to the real dimensions on restart.

Just for context, some of this had been changed recently in #5985. More information on what bug this was fixing in the PR description.

attn @bnara : can you double-check this PR does not introduce any issues with your existing moving-window workflows? This should improve the logic when moving windows start later than step 0 or stop at a certain step.

ax3l · 2025-11-20T18:30:37Z

@RemiLehe

In principle, warpx.getmoving_window_x() should be able to work with starting/stopping moving window. To avoid duplicating code (as you pointed out), should we remove the function warpx.getmoving_window_x() (since it is not used anymore) or should we try to fix it and introduce it again?

I was a bit puzzled as well why I could not find a solution with warpx.getmoving_window_x. Looking at the moving window implementation in WarpX as of today, I think it has a general flaw that the diags prob domain can shift over time compared to the simulation geometry (stored among others in getmoving_window_x) because of the step-wise (accumulative) integer rounding to align with cells:
https://github.com/BLAST-WarpX/warpx/blob/25.11/Source/Diagnostics/FullDiagnostics.cpp#L1007-L1008

I think this can be fixed by fully removing the double book-keeping in FullDiagnostics::MovingWindowAndGalileanDomainShift but that will render existing restart points unusable.

In other words, WarpX::MoveWindow should be the only source of truth, but the FullDiags does its own tracking and actually is not doing it well: compared to
https://github.com/BLAST-WarpX/warpx/blob/25.11/Source/Utils/WarpXMovingWindow.cpp#L356-L376
it cumulative looses rounds on the order of a cells size, every time it updates.

While the over all moving window does the same:
https://github.com/BLAST-WarpX/warpx/blob/25.11/Source/Utils/WarpXMovingWindow.cpp#L392
I think this causes an issue in the situation where the moving window is not active 100% of the sim steps.

Diagnostics: Fix Restart with Start/Stop Moving Window

800c072

After restarts, all diagnostics lo/hi were not properly restored. This causes new checkpoints and diagnostics to become corrupted, as the wrong spatial data gets filtered. This fixes it.

ax3l requested review from RemiLehe, lucafedeli88 and titoiride November 16, 2025 22:35

ax3l added bug Something isn't working bug: affects latest release Bug also exists in latest release version component: diagnostics all types of outputs component: checkpoint/restart Checkpointing & restarts labels Nov 16, 2025

This was referenced Nov 16, 2025

Diagnostics: Geometry Wrong After Restart #6392

Open

[WIP] Deduplicate Moving Window Logic #6400

Open

RemiLehe reviewed Nov 17, 2025

View reviewed changes

ax3l commented Nov 17, 2025

View reviewed changes

RemiLehe changed the title ~~Diagnostics: Fix Restart with Start/Stop Moving Window~~ [WIP] Diagnostics: Fix Restart with Start/Stop Moving Window Nov 19, 2025

ax3l mentioned this pull request Dec 2, 2025

Fix bug for field probe + moving window #6091

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Diagnostics: Fix Restart with Start/Stop Moving Window #6399

[WIP] Diagnostics: Fix Restart with Start/Stop Moving Window #6399

Uh oh!

ax3l commented Nov 16, 2025 •

edited

Loading

Uh oh!

ax3l commented Nov 16, 2025

Uh oh!

RemiLehe left a comment •

edited

Loading

Uh oh!

ax3l Nov 17, 2025 •

edited

Loading

Uh oh!

EZoni Nov 17, 2025 •

edited

Loading

Uh oh!

ax3l Nov 17, 2025 •

edited

Loading

Uh oh!

ax3l commented Nov 20, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[WIP] Diagnostics: Fix Restart with Start/Stop Moving Window #6399

Are you sure you want to change the base?

[WIP] Diagnostics: Fix Restart with Start/Stop Moving Window #6399

Uh oh!

Conversation

ax3l commented Nov 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

To Do

Cleanup Notice

Uh oh!

ax3l commented Nov 16, 2025

Uh oh!

RemiLehe left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ax3l Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

EZoni Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ax3l Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ax3l commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ax3l commented Nov 16, 2025 •

edited

Loading

RemiLehe left a comment •

edited

Loading

ax3l Nov 17, 2025 •

edited

Loading

EZoni Nov 17, 2025 •

edited

Loading

ax3l Nov 17, 2025 •

edited

Loading

ax3l commented Nov 20, 2025 •

edited

Loading