Skip to content

Conversation

@xylar
Copy link
Collaborator

@xylar xylar commented Oct 16, 2025

This merge also updates config_machines.xml to bring in updates to Perlmuter CPU and GPU modules (E3SM-Project/E3SM#7740). This version of config_machines.xml corresponds to https://github.com/E3SM-Project/E3SM/blob/37bf054cb4d8b6bbc3dbc6d4a377ce183f6375cd/cime_config/machines/config_machines.xml

altheaden and others added 13 commits October 2, 2025 12:46
...from get_modules_env_vars_and_mpi_compilers() and
get_spack_script()
Almost all shell templates appear to be interchangable with their
auto-generated counterparts from machine_configs.xml.

Perlmutter CPU and GPU still need a little trick to set $NERSC_HOST
if it isn't defined.
Before this fix, the autogenerated shell scripts were not getting
rendered as jinja templates
This brings in updates to Perlmuter CPU and GPU modules and
corresponds to E3SM commit hash:
37bf054cb4d8b6bbc3dbc6d4a377ce183f6375cd
@xylar xylar self-assigned this Oct 16, 2025
@xylar xylar added spack Changes relate to creating conda and Spack environments, and creating a load script config-machines Changes to the config_machines.xml file labels Oct 16, 2025
@xylar xylar requested a review from andrewdnolan October 16, 2025 12:47
@andrewdnolan andrewdnolan marked this pull request as ready for review October 20, 2025 15:00
@andrewdnolan
Copy link
Collaborator

I just looked through the last two commits, but those seem good to me.

The only thing worth mentioning is the latest changes to pm-cpu_nvidia have caused a lot of E3SM test to fail: https://my.cdash.org/builds/3148732. I don't necessarily think that should stop us, but just wanted to mention so we are aware if anything downstream breaks. (Although I don't think the nvidia compiler is super widely used).

@xylar
Copy link
Collaborator Author

xylar commented Oct 20, 2025

@andrewdnolan, other commits are all from #304, so perfect!

@xylar
Copy link
Collaborator Author

xylar commented Oct 24, 2025

Testing

I was able to deploy E3SM-Unified on Perlmuter with these updates. I feel confident enough that things are working to merge.

@andrewdnolan ran into some trouble with libfabric but this appears to have been resolved by deleting ~/.spack.

@xylar
Copy link
Collaborator Author

xylar commented Oct 24, 2025

The only thing worth mentioning is the latest changes to pm-cpu_nvidia have caused a lot of E3SM test to fail: https://my.cdash.org/builds/3148732. I don't necessarily think that should stop us, but just wanted to mention so we are aware if anything downstream breaks. (Although I don't think the nvidia compiler is super widely used).

I don't believe any of E3SM-Unified, Polaris or Compass support pm-cpu with nvidia. But even if so, it seems like trouble there might not be a good reason to hold up this update. We can always follow up with another update later once things get sorted out on the E3SM side.

@xylar xylar merged commit 592aaa9 into E3SM-Project:main Oct 24, 2025
5 checks passed
@xylar
Copy link
Collaborator Author

xylar commented Oct 24, 2025

Ack, shoot! I didn't mean to merge #303 as well!! I'll revert!

xylar added a commit to xylar/mache that referenced this pull request Oct 24, 2025
This reverts commit 592aaa9, reversing
changes made to 9357439.
xylar added a commit that referenced this pull request Oct 24, 2025
Revert "Merge pull request #306 from xylar/update-pm"
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

config-machines Changes to the config_machines.xml file spack Changes relate to creating conda and Spack environments, and creating a load script

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants