[WebNN] Always execute decomposed *SimplifiedLayerNormalization in FP32 #24437

Honry · 2025-04-16T00:55:23Z

Decomposed [Skip]SimplifiedLayerNormalization will lose precision in FP16, we'd like to add cast (to: fp32) ops around it in WebNN EP to ensure its precision rather than manually add cast nodes in each model file.

Honry · 2025-04-16T00:56:28Z

@fdwr, @guschmue, PTAL, thanks!

fdwr

👀

onnxruntime/core/providers/webnn/builders/impl/normalization_op_builder.cc

Decomposed [Skip]SimplifiedLayerNormalization will lose precision in FP16, we'd like to add cast (to: fp32) ops around it in WebNN EP to ensure its precision rather than manually add cast nodes in each model file.

Honry · 2025-04-21T02:43:07Z

@fdwr, thanks for your comments, fixed in new commit, PTAL again.

fdwr

👍

fdwr · 2025-04-21T05:22:19Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline,Windows GPU WebGPU CI Pipeline,Windows OpenVINO CI Pipeline

fdwr · 2025-04-21T05:22:22Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

fdwr · 2025-04-21T05:22:25Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI

fdwr · 2025-04-21T05:22:27Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2025-04-21T05:22:29Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2025-04-21T05:22:37Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2025-04-21T05:22:39Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2025-04-21T05:22:41Z

Azure Pipelines successfully started running 3 pipeline(s).

Honry force-pushed the cast-to-fp32-sim-layernorm branch from 3cd14fd to d14f09d Compare April 16, 2025 00:55

guschmue added the ep:WebNN WebNN execution provider label Apr 16, 2025

guschmue previously approved these changes Apr 16, 2025

View reviewed changes

fdwr reviewed Apr 19, 2025

View reviewed changes

onnxruntime/core/providers/webnn/builders/impl/normalization_op_builder.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/webnn/builders/impl/normalization_op_builder.cc Outdated Show resolved Hide resolved

Honry added 2 commits April 21, 2025 10:38

[WebNN] Always execute decomposed *SimplifiedLayerNormalization in FP32

6020852

Decomposed [Skip]SimplifiedLayerNormalization will lose precision in FP16, we'd like to add cast (to: fp32) ops around it in WebNN EP to ensure its precision rather than manually add cast nodes in each model file.

Address comments

9d390da

Honry dismissed guschmue’s stale review via 9d390da April 21, 2025 02:42

Honry force-pushed the cast-to-fp32-sim-layernorm branch from d14f09d to 9d390da Compare April 21, 2025 02:42

fdwr approved these changes Apr 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WebNN] Always execute decomposed *SimplifiedLayerNormalization in FP32 #24437

[WebNN] Always execute decomposed *SimplifiedLayerNormalization in FP32 #24437

Honry commented Apr 16, 2025

Honry commented Apr 16, 2025

fdwr left a comment

Honry commented Apr 21, 2025

fdwr left a comment

fdwr commented Apr 21, 2025

fdwr commented Apr 21, 2025

fdwr commented Apr 21, 2025

fdwr commented Apr 21, 2025

azure-pipelines bot commented Apr 21, 2025

azure-pipelines bot commented Apr 21, 2025

azure-pipelines bot commented Apr 21, 2025

azure-pipelines bot commented Apr 21, 2025

[WebNN] Always execute decomposed *SimplifiedLayerNormalization in FP32 #24437

Are you sure you want to change the base?

[WebNN] Always execute decomposed *SimplifiedLayerNormalization in FP32 #24437

Conversation

Honry commented Apr 16, 2025

Honry commented Apr 16, 2025

fdwr left a comment

Choose a reason for hiding this comment

Honry commented Apr 21, 2025

fdwr left a comment

Choose a reason for hiding this comment

fdwr commented Apr 21, 2025

fdwr commented Apr 21, 2025

fdwr commented Apr 21, 2025

fdwr commented Apr 21, 2025

azure-pipelines bot commented Apr 21, 2025

azure-pipelines bot commented Apr 21, 2025

azure-pipelines bot commented Apr 21, 2025

azure-pipelines bot commented Apr 21, 2025