JP-4260 - Enable stfitsdiff in nightly regression tests#10300
Conversation
|
Regression tests all passed with no errors or warnings: https://github.com/spacetelescope/RegressionTests/actions/runs/22682759463 |
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #10300 +/- ##
=======================================
Coverage 85.76% 85.77%
=======================================
Files 372 372
Lines 40032 40032
=======================================
+ Hits 34334 34337 +3
+ Misses 5698 5695 -3 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
#10299 will merge shortly and make some new regtest differences. I'll okify those diffs then kick off some more regtests here (before merging main in). I expect the report from stfitsdiff should be the same as this run, but with truth and new products reversed: https://github.com/spacetelescope/RegressionTests/actions/runs/22773838631 Stfitsdiff tests here: These tests ran into some new changes on main, so they include both the trace_model diffs expected, and also the same diffs as this run on main: https://github.com/spacetelescope/RegressionTests/actions/runs/22783421753 |
|
Reviewing the trace_model diffs, comparing: https://github.com/spacetelescope/RegressionTests/actions/runs/22773838631 and I think the file diffs reported are correct. I see the same number of pixels different in every case. The max absolute diff values and percent different also all match up. The reported max relative diffs are different, but I think that's because I swapped A and B files for this comparison. The x1d reports are so much more useful than the fitsdiff versions! And the report suppression for NaN-only changes look good. Reviewing the NIRCam image diffs, comparing: Same. All the diffs look right. The max relative diffs match up in this report, since As and Bs are the same for this report. |
|
Running again for some wfss_contam diffs from #10315, before merging in main. The diffs have been okified, so the new run should be equal and opposite to https://github.com/spacetelescope/RegressionTests/actions/runs/22875594770 Stfitsdiff run here: The WFSS contam diffs look right; the others are holdovers from other changes. The table print a bit funny in the first part of the report -- I think it's trying to interpret it as markup language or something? But they are clear in the second part of the report, so I don't think it's stfitsdiff's fault. |
|
Comparing some table diffs for a NIRISS SOSS specprofile reference change: Image differences are the same in both reports. There is a reporting difference for multi-dimensional table entries that we should look into. For I find the stfitsdiff report clearer about what's actually going on here -- it's nearly all of the flux values that have changed, which is easer to overlook in the fitsdiff report. But I think it might be helpful, if we can, to report the number of table cells affected, as well as the total number of numerical differences. I don't think this is a blocker for this PR, though. |
|
Updated regtests: https://github.com/spacetelescope/RegressionTests/actions/runs/24141135064 All passing. |
melanieclarke
left a comment
There was a problem hiding this comment.
Reports look good in all cases I've tested, so I think we're ready to go ahead and merge this.
and stfitsdiff script
Resolves JP-4260
This PR enables stfitsdiff in the nightly regression tests and the stfitsdiff script
Tasks
Build 12.0(use the latest build if not sure)no-changelog-entry-needed)changes/:echo "changed something" > changes/<PR#>.<changetype>.rst(see changelog readme for instructions)changes/<PR#>.breaking.rstnews fragmentdocs/pageokify_regteststo update the truth files