Skip to content

v5.0.x: Add SVE detection alongside NEON in aarch64 op component #13244

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: v5.0.x
Choose a base branch
from

Conversation

vogma
Copy link
Contributor

@vogma vogma commented May 9, 2025

Add SVE detection alongside NEON in aarch64 op component

Signed-off-by: Marco Vogel [email protected]
(cherry picked from commit 16cb214)

This is the v5.0.x PR corresponding to main PR #13204

@github-actions github-actions bot added this to the v5.0.8 milestone May 9, 2025
@vogma vogma changed the title config: add SVE detection alongside NEON in aarch64 op component v5.0.x: Add SVE detection alongside NEON in aarch64 op component May 9, 2025
Copy link

github-actions bot commented May 9, 2025

Hello! The Git Commit Checker CI bot found a few problems with this PR:

64ec344: config: add SVE detection alongside NEON in aarch6...

  • check_cherry_pick: does not include a cherry pick message (did you need to bot:notacherrypick?)

Please fix these problems and, if necessary, force-push new commits back up to the PR branch. Thanks!

1 similar comment
Copy link

github-actions bot commented May 9, 2025

Hello! The Git Commit Checker CI bot found a few problems with this PR:

64ec344: config: add SVE detection alongside NEON in aarch6...

  • check_cherry_pick: does not include a cherry pick message (did you need to bot:notacherrypick?)

Please fix these problems and, if necessary, force-push new commits back up to the PR branch. Thanks!

- Introduce AC_CACHE_CHECK probes for ARM Scalable Vector Extension (SVE)
  using both a default compile test and a second test with __attribute__((__target__("+sve"))).
- Define variables op_cv_sve_support and op_cv_sve_add_flags
- Update AM_CONDITIONAL and AC_DEFINE to expose SVE support macros
  (OMPI_MCA_OP_HAVE_SVE, OMPI_MCA_OP_SVE_EXTRA_FLAGS).
- Extend final AS_IF to enable the component when either NEON or SVE is available.
- Add a preprocessor guard around SVE-specific function attributes
- Encapsulate the +sve attribute behind OMPI_MCA_OP_SVE_EXTRA_FLAGS, ensuring
  that only builds which detected and enabled compiler SVE support will compile with
  SVE-targeted code paths.
- Simplifies later code by using SVE_ATTR in function declarations instead of
  repeating the attribute clause.
- apply SVE_ATTR macro in C source for conditional +sve targeting
- sve feature detection only on linux
- code review feedback

Signed-off-by: Marco Vogel <[email protected]>
(cherry picked from commit 16cb214)
@vogma vogma force-pushed the v5.0.x/aarch64_build_update branch from 64ec344 to 42c4eeb Compare May 9, 2025 19:04
@vogma
Copy link
Contributor Author

vogma commented May 9, 2025

The error seems to be related to the build machine ( cat: write error: No space left on device) if I'm not mistaken.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants