Negative runoff quick-fix #7809

hydrotian · 2025-10-20T10:23:51Z

A quick-fix to eliminate the negative runoff sent from ROF to OCN. Activated by setting redirect_negative_qgwl = .true. in user_nl_mosart. Two scenarios considered:
Scenario A (net_global_qgwl ≥ 0):

Proportionally scales down positive qgwl cells
Zeros out negative qgwl cells
No outlet redistribution

Scenario B (net_global_qgwl < 0):

Zeros out all qgwl
Redistributes deficit to all outlets proportionally

proteanplanet · 2025-10-20T18:08:24Z

@hydrotian Please can you provide a location of the coupled simulation with these changes for us to explore? Also can you provide diagnostics for this simulation? Finally, can you confirm that these changes pass SMS, PET, PEM and ERS tests in a B-case?

hydrotian · 2025-10-20T18:40:57Z

@proteanplanet I don't have a coupled simulations done with this PR yet but I plan to submit one following my previous Bluetip simulation. This PR passed the e3sm_land_developer test suite which includes 50+ tests on Compy with some Namelist changes and Throughput changes. See the attached test results.
test_results.txt

rljacob · 2025-10-20T21:37:43Z

Those test results don't have any PET or PEM tests. Try PET.ne4pg2_ne4pg2.I1850CNPRDCTCBCTOP and PEM.ne4pg2_ne4pg2.I1850CNPRDCTCBCTOP

hydrotian · 2025-10-20T23:26:09Z

@rljacob The PET.ne4pg2_ne4pg2.I1850CNPRDCTCBCTOP simulation failed on Compy with following error message:

 Opened existing file 
 /compyfs/inputdata/share/domains/domain.lnd.ne4pg2_oQU240.190321.nc          23
 lat/lon grid flag (isgrid2d) is  F
 ncd_inqvid: variable LANDMASK is not on dataset
 decompInit_lnd(): Number of clumps exceeds number of land grid cells
         320         211
 ENDRUN:
 ERROR in decompInitMod.F90 at line 183

It is strange as I did not modify the land model in this PR. Any ideas? Should I try it on Chrysalis instead?

rljacob · 2025-10-20T23:28:50Z

Yes try chrysalis. There may not be a good pelayout for that case on compy.

ambrad · 2025-10-20T23:31:05Z

components/mosart/src/riverroute/RtmMod.F90

+    integer, allocatable :: outlet_gindices_local(:) ! Local array of global indices of outlets on this task
+    real(r8), allocatable :: outlet_discharges_local(:) ! Local array of discharges for these outlets
+    integer :: local_outlet_count
+    integer, allocatable :: all_outlet_gindices(:)    ! Gathered on master


A number of these variables look unused.

They are removed. Thanks.

ambrad · 2025-10-20T23:32:16Z

components/mosart/src/riverroute/RtmMod.F90

+
+            ! Reproducible sum for negative qgwl
+            neg_local(1,1) = local_negative_qgwl_sum
+            call shr_reprosum_calc(neg_local, neg_global, 1, 1, 1, &


You could combine the two calls to shr_reprosum_calc into one call because it looks like the two fields are independent of each other. That would be more efficient than two calls.

Thanks. The two calls are now combined.

jonbob · 2025-10-22T18:24:09Z

I ran a PEM.ne30pg2_r05_IcoswISC30E3r5.WCYCL1850.chrysalis_intel test and it failed the comparison between the two runs. The PEM_Ln9.ne30pg2_r05_IcoswISC30E3r5.WCYCL1850.chrysalis_intel test that's in e3sm_integration passes, but since it's only running 9 steps mosart only runs once in that test

hydrotian · 2025-10-22T18:36:22Z

I ran a PEM.ne30pg2_r05_IcoswISC30E3r5.WCYCL1850.chrysalis_intel test and it failed the comparison between the two runs. The PEM_Ln9.ne30pg2_r05_IcoswISC30E3r5.WCYCL1850.chrysalis_intel test that's in e3sm_integration passes, but since it's only running 9 steps mosart only runs once in that test

My PET.ne4pg2_ne4pg2.I1850CNPRDCTCBCTOP passed, but the PEM.ne4pg2_ne4pg2.I1850CNPRDCTCBCTOP failed on comparison as well, because the 2nd run couldn't complete. I increased the walltime to 2 hours (maximum for a debug queue on Chrysalis?) but the simulation appeared to stall at some point. Then I tested the baseline (64046ec) and failed at the same point.

jonbob · 2025-10-22T18:43:27Z

Thanks @hydrotian -- I checked and both runs for my PEM test completed fine, just had different results. I'm running a similar PET test right now

hydrotian · 2025-10-22T18:49:55Z

@jonbob Thanks. Could you share the cprnc.out report? I want to see which fields are different between the two runs.

jonbob · 2025-10-22T18:53:38Z

Sure, but after five days it ends up with 351 out of 507 fields different. It's at:

/lcrc/group/acme/ac.jwolfe/scratch/chrys/PEM.ne30pg2_r05_IcoswISC30E3r5.WCYCL1850.chrysalis_intel.20251022_120245_ruutak/PEM.ne30pg2_r05_IcoswISC30E3r5.WCYCL1850.chrysalis_intel.20251022_120245_ruutak.cpl.hi.0001-01-06-00000.nc.base.cprnc.out

jonbob · 2025-10-22T19:36:41Z

OK, the similar PET test (PET.ne30pg2_r05_IcoswISC30E3r5.WCYCL1850.chrysalis_intel) passed

hydrotian · 2025-10-22T19:53:48Z

Thanks, @jonbob. Any insights about the PEM test fail? Would you mind doing a same PEM test for the baseline master where I branched from (64046ec)?

jonbob · 2025-10-22T19:57:23Z

No insights from the PEM test -- we would have to do one where we tried to catch the first field that gets different answers. @proteanplanet noticed that you have a routine for sort_outlets_by_discharge_desc but we couldn't see it getting called?

hydrotian · 2025-10-22T20:12:33Z

Yes. That was from an earlier commit on this branch. I can clean it up.

rljacob · 2025-10-22T21:09:34Z

To get a better idea of when it diffs, change the river coupling frequency to match the other models. That might allow you to go back to a 9 nstep test. Also change the coupler history output to be every timestep.

jonbob · 2025-10-22T21:14:47Z

@hydrotian -- I set redirect_negative_qgwl = .false. in your branch and the PEM test passes

This reverts commit df6114f.

hydrotian · 2025-11-03T21:27:47Z

The PEM test has passed now. Both @jonbob and I confirmed that on our separate tests.

Tian Zhou and others added 7 commits May 17, 2025 00:01

Add scheme to redirect total Qgwl to top 100 river outlets

755430b

Implementation to only redistribute negative Qgwl

900af67

implement offsetting scheme

b96bd31

Reprosum instead of MPI_Allreduce

8cb9508

Fixing error for Scenario B

3306222

Fixing error for Scenario A

9078cba

optimize diagnostic outputs

2fc7068

hydrotian requested review from ambrad, bishtgautam, proteanplanet and wlin7 October 20, 2025 10:25

hydrotian assigned bishtgautam Oct 20, 2025

hydrotian added BFB PR leaves answers BFB MOSART Concerning the MOSART river model labels Oct 20, 2025

rljacob added the v3.1beta label Oct 20, 2025

rljacob requested a review from jonbob October 20, 2025 15:45

ambrad reviewed Oct 20, 2025

View reviewed changes

hydrotian added 9 commits October 27, 2025 15:46

try to fix PEM test fail

df6114f

Revert "try to fix PEM test fail"

8948e32

This reverts commit df6114f.

fix reprosum error

c2ca073

remove some diag calculations

1bf9a82

detect reprosum problem

39fa31c

revert to seperate reprosum calls

f505556

another attempt

a91ff9e

PEM test passed

a28ca9c

remove unused stuff

ec68102

consolidate diagnostic outputs

89d781c

Negative runoff quick-fix #7809

Are you sure you want to change the base?

Negative runoff quick-fix #7809

Conversation

hydrotian commented Oct 20, 2025

Uh oh!

proteanplanet commented Oct 20, 2025

Uh oh!

hydrotian commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rljacob commented Oct 20, 2025

Uh oh!

hydrotian commented Oct 20, 2025

Uh oh!

rljacob commented Oct 20, 2025

Uh oh!

ambrad Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

hydrotian Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

ambrad Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

hydrotian Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

jonbob commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hydrotian commented Oct 22, 2025

Uh oh!

jonbob commented Oct 22, 2025

Uh oh!

hydrotian commented Oct 22, 2025

Uh oh!

jonbob commented Oct 22, 2025

Uh oh!

jonbob commented Oct 22, 2025

Uh oh!

hydrotian commented Oct 22, 2025

Uh oh!

jonbob commented Oct 22, 2025

Uh oh!

hydrotian commented Oct 22, 2025

Uh oh!

rljacob commented Oct 22, 2025

Uh oh!

jonbob commented Oct 22, 2025

Uh oh!

hydrotian commented Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

hydrotian commented Oct 20, 2025 •

edited

Loading

jonbob commented Oct 22, 2025 •

edited

Loading