[compiler-rt][AArch64] Provide basic implementations of SME memcpy/memmove in case of strictly aligned memory access #138250

vhscampos · 2025-05-02T10:57:01Z

The existing implementations, written in assembly, make use of unaligned accesses for performance reasons. They are not compatible with strict aligned configurations, i.e. with -mno-unaligned-access.

If the functions are used in this scenario, an exception is raised due to unaligned memory accesses.

This patch reintroduces vanilla implementations for these functions to be used in strictly aligned configurations. The actual code is largely based on the code from #77496

compiler-rt/lib/builtins/aarch64/sme-libc-routines.c

david-arm

LGTM!

compiler-rt/lib/builtins/aarch64/sme-libc-routines.c

…mmove in case of strictly aligned memory access The existing implementations, written in assembly, make use of unaligned accesses for performance reasons. They are not compatible with strict aligned configurations, i.e. with `-mno-unaligned-access`. If the functions are used in this scenario, an exception is raised due to unaligned memory accesses. This patch reintroduces vanilla implementations for these functions to be used in strictly aligned configurations. The actual code is largely based on the code from llvm#77496

- Split functions into separate files. - Select which implementation to use based on target features. The selection is now done in CMake.

vhscampos · 2025-06-05T09:49:29Z

compiler-rt/lib/builtins/CMakeLists.txt

@@ -600,9 +600,17 @@ if (COMPILER_RT_HAS_AARCH64_SME)
    set_source_files_properties(aarch64/arm_apple_sme_abi.s PROPERTIES COMPILE_FLAGS -march=armv8a+sme)
    message(STATUS "AArch64 Apple SME ABI routines enabled")
  elseif (NOT COMPILER_RT_DISABLE_AARCH64_FMV AND COMPILER_RT_HAS_FNO_BUILTIN_FLAG AND COMPILER_RT_AARCH64_FMV_USES_GLOBAL_CONSTRUCTOR)
-    list(APPEND aarch64_SOURCES aarch64/sme-abi.S aarch64/sme-libc-mem-routines.S aarch64/sme-abi-assert.c aarch64/sme-libc-routines.c)


The change inadvertently removed files from the build. I will fix it shortly.

…mmove in case of strictly aligned memory access (llvm#138250) The existing implementations, written in assembly, make use of unaligned accesses for performance reasons. They are not compatible with strict aligned configurations, i.e. with `-mno-unaligned-access`. If the functions are used in this scenario, an exception is raised due to unaligned memory accesses. This patch reintroduces vanilla implementations for these functions to be used in strictly aligned configurations. The actual code is largely based on the code from llvm#77496

vhscampos requested a review from kmclaughlin-arm May 2, 2025 10:57

llvmbot added compiler-rt compiler-rt:builtins labels May 2, 2025

vhscampos requested review from david-arm and smithp35 May 2, 2025 10:57

david-arm requested a review from sdesmalen-arm May 9, 2025 08:38

david-arm reviewed May 9, 2025

View reviewed changes

compiler-rt/lib/builtins/aarch64/sme-libc-routines.c Outdated Show resolved Hide resolved

compiler-rt/lib/builtins/aarch64/sme-libc-routines.c Outdated Show resolved Hide resolved

david-arm approved these changes May 12, 2025

View reviewed changes

compiler-rt/lib/builtins/aarch64/sme-libc-routines.c Outdated Show resolved Hide resolved

sdesmalen-arm reviewed May 12, 2025

View reviewed changes

compiler-rt/lib/builtins/aarch64/sme-libc-routines.c Outdated Show resolved Hide resolved

vhscampos added 2 commits June 3, 2025 09:34

Changes:

3b20924

- Split functions into separate files. - Select which implementation to use based on target features. The selection is now done in CMake.

vhscampos force-pushed the sme-intrinsics-strict-alignment branch from ca036e8 to 3b20924 Compare June 3, 2025 08:46

vhscampos merged commit 75c3ff8 into llvm:main Jun 3, 2025
10 checks passed

vhscampos deleted the sme-intrinsics-strict-alignment branch June 3, 2025 09:59

vhscampos commented Jun 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[compiler-rt][AArch64] Provide basic implementations of SME memcpy/memmove in case of strictly aligned memory access #138250

[compiler-rt][AArch64] Provide basic implementations of SME memcpy/memmove in case of strictly aligned memory access #138250

Uh oh!

vhscampos commented May 2, 2025

Uh oh!

Uh oh!

Uh oh!

david-arm left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vhscampos Jun 5, 2025

Uh oh!

Uh oh!

[compiler-rt][AArch64] Provide basic implementations of SME memcpy/memmove in case of strictly aligned memory access #138250

[compiler-rt][AArch64] Provide basic implementations of SME memcpy/memmove in case of strictly aligned memory access #138250

Uh oh!

Conversation

vhscampos commented May 2, 2025

Uh oh!

Uh oh!

Uh oh!

david-arm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vhscampos Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!