Skip to content

[Task] SPU LLVM: Vectorized Instructions #16684

Open
@elad335

Description

@elad335

It is common, especially in floating point code, to unroll loops by four or two and thus ensuring concurrent execution of instructions on the same CPU core.
Here is an example of program that does this:

Image

Notice how many instructions are repeated using different registers.
The task is to execute multiple at once and thus improving performance extensively for these programs.

Metadata

Metadata

Assignees

Labels

CPULLVMRelated to LLVM instruction decodersOptimizationOptimizes existing code

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions