Skip to content

Conversation

@js00070
Copy link

@js00070 js00070 commented Jan 30, 2026

Add always_inline to dlp_execute_kernel to ensure it is inlined when building with LTO. With LLVM 19, this attribute only takes effect under LTO; in non-LTO builds, inlining is not guaranteed. This improves performance for tiny shapes.

Add always_inline to dlp_execute_kernel to ensure it is inlined when
building with llvm19&LTO. This attribute only takes effect under
LTO; in non-LTO builds, inlining is not guaranteed. This improves
performance for tiny shapes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant