Skip to content

[Downstream Change] [Pipelines] Additional unrolling in LTO #409

@VladiKrapp-Arm

Description

@VladiKrapp-Arm

What is the change
Add extra full unrolling pass to improve performance on cores with branch predictors. It helps produce simplified loops, which can then be SROA'd allowing further simplification.
This change is already present in the codebase in the form of a performance patch file.

Why this change cannot be done upstream
The introduced change is in common code with no easy way to demonstrate general usefullness. We know this to be benefitial for some Arm specific cases from internal benchmarking.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions