Skip to content

Enable inlining for dlp_execute_kernel with LTO#8

Closed
js00070 wants to merge 1 commit into
amd:devfrom
js00070:zhiyi/pr
Closed

Enable inlining for dlp_execute_kernel with LTO#8
js00070 wants to merge 1 commit into
amd:devfrom
js00070:zhiyi/pr

Conversation

@js00070

@js00070 js00070 commented Jan 30, 2026

Copy link
Copy Markdown
Contributor

Add always_inline to dlp_execute_kernel to ensure it is inlined when building with LTO. With LLVM 19, this attribute only takes effect under LTO; in non-LTO builds, inlining is not guaranteed. This improves performance for tiny shapes.

Add always_inline to dlp_execute_kernel to ensure it is inlined when
building with llvm19&LTO. This attribute only takes effect under
LTO; in non-LTO builds, inlining is not guaranteed. This improves
performance for tiny shapes.
@BhaskarNallani

Copy link
Copy Markdown
Collaborator

Merged into dev branch.

506bce8

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants