Skip to content

Large Kernels: Use AMREX_NO_INLINE #4716

Open
@ax3l

Description

@ax3l

In ROCm compilers as of early 2024, the compiler force inlines everything.

While generally nice, this can be problematic for very large kernels in both compile and runtime, if we actually want to enforce a function call and jump.

We should investigate if we have places like this, GatherAndPush comes to mind for some of the larger runtime combinations, where we want to add a AMREX_NO_INLINE to prevent this.

To be evaluated. Thanks to @zingale for bringing this up.

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions