Branch target alignment on Apple Silicon

PR #59828 started enforcing 32-byte alignment for methods with loops on ARM64 based on the Neoverse N1 optimization guide:

https://github.com/dotnet/runtime/blob/16fe4d41ae95607f9214874ab7f22b3df5d8e561/src/coreclr/jit/emit.cpp#L6775-L6785

This goes contrary to the Apple Silicon CPU Optimization Guide, section 4.4.3 Branch Target Alignment. Apple specifically states that software alignment of branch targets is unnecessary and sometimes detrimental due to the alignment capabilities of the processor. The guidance is to not align branch targets and favor smaller code size.

Aside for performance implications this alignment makes the size of NativeAOT code on iOS/macOS bigger due to unnecessary alignment.

	#if defined(TARGET_XARCH) \|\| defined(TARGET_ARM64)
	// For x64/x86/arm64, align methods that are "optimizations enabled" to 32 byte boundaries if
	// they are larger than 16 bytes and contain a loop.
	//
	if (emitComp->opts.OptimizationEnabled() &&
	(!emitComp->opts.jitFlags->IsSet(JitFlags::JIT_FLAG_PREJIT) \|\| comp->IsTargetAbi(CORINFO_NATIVEAOT_ABI)) &&
	(emitTotalHotCodeSize > 16) && emitComp->fgHasLoops)
	{
	codeAlignment = 32;
	}
	#endif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Branch target alignment on Apple Silicon #107284

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Branch target alignment on Apple Silicon #107284

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions