[WebGPU EP] Add EINSUM implementation #24358

feich-ms · 2025-04-09T09:36:35Z

Description

This PR added the native implementation of einsum operator, based and expanded on existing einsum.ts. All the test cases in einsum_test.cc have been passed.

The equation attribute value of einsum op is a string which consists of left hand side (LHS) and optionally right hand side (RHS) separated by '->'. Ex.

"ij->ji" matrix transpose
"ii->i" diagonal elements of a square matrix
"ij->" sum over all elements of a matrix
"ij,jk->ik" explicit matrix multiplication
"ij,jk" implicit matrix multiplication
"ij,jk->" matrix multiplication and sum over all elements
"ij,jk,kl->il" three matrix multiplication
"...ij,...jk->...ik" batched matmul with broadcasting
",...i->...i" matrix element multiplication with one scalar
"abc,cd->abc" keep the original abc matrix shape but matmul and sum over along d

LHS consists of a sequence of terms separated by commas. Each term corresponds to an input variable.
Each symbol corresponds to a dimension in the input variable. The symbol can be either a letter, 'a' to 'z' or 'A' to
'Z' or '...' to represent arbitrary dimensions or empty to represent a scalar.

Empty RHS are handleed differently for implicit vs explicit modes.

Implicit mode - arrow is not in the equation where the equation "ij,jk" equals to "ij,jk->ik" which is actually a matrix multiplication.
Explicit mode - arrow is in the equation where the equation "ij,jk->" contains two steps, first step is a matrix multiplication just like the implicit mode, and the second step is to sum up the matrix produced by the first step to a scalar.

For all the test cases, pls refer to einsum_test.cc

onnxruntime/core/providers/webgpu/math/einsum.cc

feich-ms · 2025-04-18T04:50:30Z

@satyajandhyala @xiaofeihan1 @qjia7 @guschmue @fs-eire, pls help to reiview, thanks.

onnxruntime/core/providers/webgpu/math/einsum.cc

fs-eire · 2025-04-24T21:12:17Z

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows GPU Doc Gen CI Pipeline, Windows x64 QNN CI Pipeline

azure-pipelines · 2025-04-24T21:12:38Z

Azure Pipelines successfully started running 5 pipeline(s).

Copilot

Pull Request Overview

This PR adds the native Einsum implementation for the WebGPU provider, enhancing operator support by integrating a new kernel alongside its corresponding tests. Key changes include:

Adding a new test case in onnxruntime/test/providers/cpu/math/einsum_test.cc for explicit Einsum reduction with multi-input.
Enabling the Einsum kernel registration in onnxruntime/core/providers/webgpu/webgpu_execution_provider.cc.
Introducing the Einsum operator implementation in onnxruntime/core/providers/webgpu/math/einsum.h.

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated no comments.

File	Description
onnxruntime/test/providers/cpu/math/einsum_test.cc	Added a test case for explicit Einsum reduction to scalar with multi-input.
onnxruntime/core/providers/webgpu/webgpu_execution_provider.cc	Enabled the Einsum kernel by uncommenting the registration entry.
onnxruntime/core/providers/webgpu/math/einsum.h	Introduced the new Einsum operator abstraction and related utility classes.

fs-eire

lgtm

### Description  This PR added the native implementation of einsum operator, based and expanded on existing einsum.ts. All the test cases in einsum_test.cc have been passed. The equation attribute value of einsum op is a string which consists of left hand side (LHS) and optionally right hand side (RHS) separated by '->'. Ex. - "ij->ji" matrix transpose - "ii->i" diagonal elements of a square matrix - "ij->" sum over all elements of a matrix - "ij,jk->ik" explicit matrix multiplication - "ij,jk" implicit matrix multiplication - "ij,jk->" matrix multiplication and sum over all elements - "ij,jk,kl->il" three matrix multiplication - "...ij,...jk->...ik" batched matmul with broadcasting - ",...i->...i" matrix element multiplication with one scalar - "abc,cd->abc" keep the original abc matrix shape but matmul and sum over along d LHS consists of a sequence of terms separated by commas. Each term corresponds to an input variable. Each symbol corresponds to a dimension in the input variable. The symbol can be either a letter, 'a' to 'z' or 'A' to 'Z' or '...' to represent arbitrary dimensions or empty to represent a scalar. Empty RHS are handleed differently for implicit vs explicit modes. - Implicit mode - arrow is not in the equation where the equation "ij,jk" equals to "ij,jk->ik" which is actually a matrix multiplication. - Explicit mode - arrow is in the equation where the equation "ij,jk->" contains two steps, first step is a matrix multiplication just like the implicit mode, and the second step is to sum up the matrix produced by the first step to a scalar. For all the test cases, pls refer to einsum_test.cc

github-advanced-security bot found potential problems Apr 9, 2025

View reviewed changes

onnxruntime/core/providers/webgpu/math/einsum.cc Fixed Show fixed Hide fixed

satyajandhyala added the ep:WebGPU ort-web webgpu provider label Apr 9, 2025

feich-ms added 6 commits April 11, 2025 09:27

initial changes by cline LLM model

f90c285

update einsum.h and einsum.cc

c64cdfe

write the dynamical j_max uniform variable to shader directly

a5cb816

fix a test break

929f9cc

revert changes in CMakeLists.txt

5726c05

fix a test break

62da03b

feich-ms force-pushed the user/feich-ms/migrate_einsum_op_to_native branch from ce7ce7f to 62da03b Compare April 11, 2025 01:27

feich-ms added 7 commits April 14, 2025 16:05

fix some test cases

68216be

optimize

454d12a

fix ci breaks

7b78462

fix ci breaks caused by implicit conversion loses

93407d1

use input_shape as uniform variable

e744576

optimize logic to handle all test cases

1dfa521

revert bad changes in einsum_test.cc

4a45829

feich-ms marked this pull request as ready for review April 18, 2025 03:22

fix lint issue

88a395c

fs-eire reviewed Apr 18, 2025

View reviewed changes

onnxruntime/core/providers/webgpu/math/einsum.cc Show resolved Hide resolved

fs-eire reviewed Apr 18, 2025

View reviewed changes

onnxruntime/core/providers/webgpu/math/einsum.cc Outdated Show resolved Hide resolved

feich-ms added 4 commits April 21, 2025 14:16

use input{i}_indices_t offerred by indices helper

0170b1a

fix warning issue

d94d63e

add comments for potential optimization

14d6e80

fix lint issue

7266acf

fs-eire requested a review from Copilot April 28, 2025 22:50

Copilot AI reviewed Apr 28, 2025

View reviewed changes

fs-eire approved these changes Apr 28, 2025

View reviewed changes

fs-eire merged commit e14bcd3 into microsoft:main Apr 29, 2025
70 checks passed

feich-ms deleted the user/feich-ms/migrate_einsum_op_to_native branch May 29, 2025 03:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WebGPU EP] Add EINSUM implementation #24358

[WebGPU EP] Add EINSUM implementation #24358

Uh oh!

feich-ms commented Apr 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

feich-ms commented Apr 18, 2025

Uh oh!

Uh oh!

Uh oh!

fs-eire commented Apr 24, 2025

Uh oh!

azure-pipelines bot commented Apr 24, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

fs-eire left a comment

Uh oh!

Uh oh!

Uh oh!

[WebGPU EP] Add EINSUM implementation #24358

[WebGPU EP] Add EINSUM implementation #24358

Uh oh!

Conversation

feich-ms commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

Uh oh!

feich-ms commented Apr 18, 2025

Uh oh!

Uh oh!

Uh oh!

fs-eire commented Apr 24, 2025

Uh oh!

azure-pipelines bot commented Apr 24, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

fs-eire left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

feich-ms commented Apr 9, 2025 •

edited

Loading