xe: enable resuable_dispatcher_t to handle all buffer layouts #4474

rjoursler · 2025-12-19T20:26:43Z

JIC anyone wants to review this over the holidays. There is still a little bit of work remaining to ensure there are no functional or performance regression, but the core changes are now implemented.

This PR modifies the generation and encoding used by reusable_dispatcher_t to significantly increase flexibility. This increased flexibility is then used to make the reference eltwise kernel 100% completely reusable. The key changes in this PR:

Switch to directly encoding expressions for calculations - this enables better compression of the expressions, so that more buffers can be registered, to encode expressions unrelated to buffer offsets (i.e. gws_overflow and gws_in_padding), and to optimize expensive computations (i.e. the addition idiv).
Switch buffer offsets computation to be in terms of the outer dimensions i.e. sum(outer_dim_idx * outer_dim_stride) + offset(get_inner_dim()). This enables properly inclining constants associated with blocked layouts, which avoids expensive divisions.

Beyond that, there are a few other (but more minor) optimizations at play

Switch named buffer encoding to a bitset. This reduces the overall structure size as we do not need to store buffer names and normalizes the structure layout which could prevent non-determinism due to ordering.
Use 32-bit values in the runtime params when possible. This reduces the necessary data transfer to the kernel, and also avoids emulated 64-bit arithmetic.

Introduces a named buffer constructor in scenarios where assigning dimension ids is largely unnecessary.

This structure is excessively large and scales very poorly with adding more buffers.

rjoursler · 2026-01-02T23:24:57Z

make test
disable test_device_cpu

rjoursler requested a review from a team as a code owner December 19, 2025 20:26

github-actions bot added the platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel label Dec 19, 2025

rjoursler changed the title ~~xe: enable completely reusable kernels~~ xe: enable resuable_dispatcher_t to handle all buffer layouts Dec 19, 2025

rjoursler force-pushed the rjoursle/reusable branch from ccb37a7 to 7b26d1c Compare December 19, 2025 23:16

rjoursler force-pushed the rjoursle/gemmstone_align branch 2 times, most recently from 1f1d521 to 9898aab Compare January 2, 2026 18:56

rjoursler added 8 commits January 2, 2026 13:53

xe: support iteration over empty block_structure

f2ceaba

xe: compute: introduce simplified named_buffer_t constructor

36926ee

Introduces a named buffer constructor in scenarios where assigning dimension ids is largely unnecessary.

xe: compute: switch buffer registration to a bitset

edab01d

xe: optimze dispatch_compile_params_t size

e13b345

This structure is excessively large and scales very poorly with adding more buffers.

xe: switch to using 32 bit structures when possible

ea32f42

xe: remove restriction on padded layouts

35c5f4f

xe: fix constants with punning

93ef69b

xe: eltwise: make reference implementation reusable

f690f68

rjoursler force-pushed the rjoursle/reusable branch from 7b26d1c to f690f68 Compare January 2, 2026 22:28

rjoursler changed the base branch from rjoursle/gemmstone_align to main January 2, 2026 23:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

xe: enable resuable_dispatcher_t to handle all buffer layouts #4474

xe: enable resuable_dispatcher_t to handle all buffer layouts #4474

Uh oh!

rjoursler commented Dec 19, 2025 •

edited

Loading

Uh oh!

rjoursler commented Jan 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

xe: enable resuable_dispatcher_t to handle all buffer layouts #4474

Are you sure you want to change the base?

xe: enable resuable_dispatcher_t to handle all buffer layouts #4474

Uh oh!

Conversation

rjoursler commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rjoursler commented Jan 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rjoursler commented Dec 19, 2025 •

edited

Loading