Skip to content

Conversation

@darincomeau
Copy link
Owner

This adds testmods for the E3SMv2-Arctic configuration currently being run, including changes to namelist defaults.

This can be used by the following, on Cori-KNL as an example:

cd cime/scripts
./create_test ERS.ne30pg2_oARRM60to10.WCYCL1850.cori-knl_intel.allactive-v2arctic --project m1199 --walltime 03:00:00

Currently has this ERS test in the queue.

Do not merge until user_nl_* file settings have been settled.

@darincomeau darincomeau self-assigned this Mar 29, 2022
@milenaveneziani
Copy link
Collaborator

@darincomeau: my only question is: should we wait until we have a more definite list of namelist options we want to use for sure? I'm also OK if we want to merge this in and then modify things later, whichever works for you.

Copy link
Collaborator

@apcraig apcraig left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will only work on cori because of the hardwired paths to inputdata files. Can we leverage $DIN_LOC_ROOT in these scripts?

@darincomeau
Copy link
Owner Author

@milenaveneziani yes I agree we should wait until namelist options are more settled., hence my 'do not merge' comment above.

@darincomeau
Copy link
Owner Author

@apcraig good point - I'll make those changes to the elm input files.

@darincomeau
Copy link
Owner Author

Just reporting that my initial test using this went through and PASSed:

ERS.ne30pg2_oARRM60to10.WCYCL1850.cori-knl_intel.allactive-v2arctic.20220329_120453_ulz9d4.cpl.hi.0001-01-12-00000.nc.base matched ERS.ne30pg2_oARRM60to10.WCYCL1850.cori-knl_intel.a
llactive-v2arctic.20220329_120453_ulz9d4.cpl.hi.0001-01-12-00000.nc.rest
PASS

@milenaveneziani
Copy link
Collaborator

we are leaving this for last, right?

@apcraig
Copy link
Collaborator

apcraig commented Aug 12, 2022

It makes sense to me to leave this for last. We should also check whether it reflects the latest setup with the new code.

darincomeau pushed a commit that referenced this pull request Sep 20, 2023
cee/15.0.0 with GPU MPI buffers can crash in a system lib like this:

#4  0x00007fffe159e35b in (anonymous namespace)::do_free_with_callback(void*, void (*)(void*)) [clone .constprop.0] () from /opt/cray/pe/cce/15.0.0/cce/x86_64/lib/libtcmalloc_minimal.so.1
#5  0x00007fffe15a8f16 in tc_free () from /opt/cray/pe/cce/15.0.0/cce/x86_64/lib/libtcmalloc_minimal.so.1
#6  0x00007fffe99c2bcd in _dlerror_run () from /lib64/libdl.so.2
#7  0x00007fffe99c2481 in dlopen@@GLIBC_2.2.5 () from /lib64/libdl.so.2
#8  0x00007fffea7bce42 in _ad_cray_lock_init () from /opt/cray/pe/lib64/libmpi_cray.so.12
#9  0x00007fffed7eb37a in call_init.part () from /lib64/ld-linux-x86-64.so.2
#10 0x00007fffed7eb496 in _dl_init () from /lib64/ld-linux-x86-64.so.2
#11 0x00007fffed7dc58a in _dl_start_user () from /lib64/ld-linux-x86-64.so.2
#12 0x0000000000000001 in ?? ()
#13 0x00007fffffff42e7 in ?? ()
#14 0x0000000000000000 in ?? ()

Work around this by using cee/14.0.3.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants