Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make setting CHPL_* environment variables to invalid values an error #26501

Merged
merged 2 commits into from
Mar 26, 2025

Conversation

jabraham17
Copy link
Member

@jabraham17 jabraham17 commented Jan 9, 2025

Makes most CHPL_* environment variables produce an error if set to an invalid value. This can help prevent user error when accidentally doing the wrong thing.

For example, a user might misspell something or just misunderstand how to set a variable. Previously, we would just silently take that value and maybe break in expected ways later. Now, we refuse to accept it and tell the user exactly what they did wrong

Previously,

export CHPL_LLVM=ssystem
make # error, cannot find Makefile.include-ssystem

Now,

export CHPL_LLVM=ssystem
make # Error: CHPL_LLVM=ssystem is not valid, CHPL_LLVM must be none, bundled, or system

[Reviewed by @mppf]

@jabraham17 jabraham17 self-assigned this Feb 6, 2025
@jabraham17 jabraham17 marked this pull request as ready for review March 24, 2025 16:05
@jabraham17 jabraham17 requested a review from mppf March 24, 2025 20:18
@bradcray
Copy link
Member

Eventually (whether in this PR or elsewhere), I think it'd be interesting to make some of these settings' values based not on a list of values embedded in the script but other artifacts within the source tree. For example, CHPL_COMM's legal values might based on the presence of a subdirectory within runtime/src/comm/* or CHPL_*_COMPILER's legal values might be based on the presence of a make/compiler/Makefile.* file. The goal of such an approach would be to permit someone to create their own clone of a comm layer or compiler Makefile, give it a new name, and test it immediately without editing the error-checking code.

The main downsides that I can think of are (a) would it work outside of the full source tree? (e.g., maybe we only release runtime/include/comm/* and not the sources, so maybe should look there?) and (b) would it increase the time required to do the checks?

@jabraham17
Copy link
Member Author

jabraham17 commented Mar 24, 2025

I think that is a very attractive idea. I am very much in support of only having to specify things once, so inferring the valid values of CHPL_COMM from the runtime source tree is attractive. However, as you point out, that does not work once you get into other release formats that don't have the source tree (linux packages immediately come to mind)

Note that this PR only adds checks for things like CHPL_COMM and CHPL_LLVM that have discrete things values. It does not add checking for things like CHPL_*_PLATFORM or CHPL_*_COMPILER, because its much harder to pin down what a "correct" value for those is and they are easier to change (relative to adding a whole new COMM layer, for example)

@bradcray
Copy link
Member

However, as you point out, that does not work once you get into other release formats that don't have the source tree (linux packages immediately come to mind)

We must still ship the runtime headers with our binary release formats, though?

@jabraham17
Copy link
Member Author

Yup, you are correct. I just checked, we do ship the runtime includes

@mppf
Copy link
Member

mppf commented Mar 26, 2025

The goal of such an approach would be to permit someone to create their own clone of a comm layer or compiler Makefile, give it a new name, and test it immediately without editing the error-checking code.

IMO we should stick with simple checking for invalid values here. IMO an invalid value error from printchplenv etc won't be hard to fix because it amounts to searching to find the code emitting the error & then adjusting the list. And how often do we add a new comm layer etc? I think we should opt for simplicity & more understandable error checking rather than making something more complex.

@mppf
Copy link
Member

mppf commented Mar 26, 2025

@jabraham17 - I can review this PR in its current state but please rebase it to clean up the history. The CI checks are getting tripped up by the big merge commit & a merge shouldn't be necessary for this PR changing ~ 100 lines.

@@ -36,6 +36,7 @@ def get():
else:
gmp_val = 'none'

check_valid_var('CHPL_GMP', gmp_val, ['none', 'bundled', 'system'])
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Huh, this checks it even if it was the default. You could do that everywhere but if not IMO it's better to put this up top with the conditional

if gmp_val:
  check_valid_var('CHPL_GMP', gmp_val, ['none', 'bundled', 'system'])
else;
  ... set default ...

Let's pick one strategy and try to use it throughout.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I prefer checking only if we don't try to infer something. I will adjust accordingly

Copy link
Member Author

@jabraham17 jabraham17 Mar 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I take that comment back. When writing this I used always checking more consistently and for good reason. Many of these scripts already have complex logic, this would make it worse. I am going make them all always check the valid values (except for something like checking CHPL_GASNET_SEGMENT only with gasnet)

@jabraham17 jabraham17 merged commit 924c130 into chapel-lang:main Mar 26, 2025
9 checks passed
@jabraham17 jabraham17 deleted the chplenv-errors branch March 26, 2025 18:37
jabraham17 added a commit that referenced this pull request Mar 26, 2025
Fixes a failing test (`test/chplenv/chplconfig/warnings/warnings.chpl`)
resulting from #26501.

This issue occurs in a very strange way.
1. The test system always explicitly sets `CHPL_GASNET_SEGMENT`, in the
default shared mem config this will always be `none`
    `os.environ["CHPL_GASNET_SEGMENT"] = chpl_comm_segment.get()`
2. This test uses a value of `CHPL_COMM=gasnet` when testing for
multiple values of `CHPL_COMM` in a chplconfig file.
3. This test uses a custom script to ensure a clean environment when
diff-ing the test output, `test/chplenv/chplconfig/unset_overrides.sh`.
- This script uses the `--overrides` flag of `printchplenv` to determine
what variables to unset.
4. By default, `--tidy` is thrown by `printchplenv` to restrict the
chplenv output to only relevant variables, e.g. with CHPL_COMM=none,
gasnet specific variables aren't shown

This results in the good file for the test being generated with
`CHPL_COMM=gasnet` and `CHPL_GASNET_SEGMENT=none`, which is not valid.
This is allowed to happen because the `unset_overrides.sh` script is
unintentionally leaving garbage chplenv variables behind.

The solution to this is to make `unset_overrides.sh` explicitly use
`--no-tidy`, so that it actually does what it is supposed to. Note this
must be done before `--overrides` on the CLI, otherwise it has no effect
(flags are applied in order)

This PR also fixes a typo I noticed while determining all this.

[Reviewed by @lydia-duncan]
jabraham17 added a commit that referenced this pull request Mar 27, 2025
Adds `smp` to the list of valid substrates, I forgot about this when
developing #26501

[Not reviewed - trivial]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants