Add ONNX opset 23 RMSNormalization operator support by AditiThirdEye · Pull Request #1046 · onnx/onnx-tensorrt

AditiThirdEye · 2025-12-29T04:35:45Z

Implements RMSNormalization operator for TensorRT ONNX parser, enabling deployment of modern transformer architectures (LLaMA, Mistral, etc.) that use RMSNorm instead of LayerNorm.

Implementation details:

Computes Y = (X / sqrt(mean(X^2) + epsilon)) * scale
Supports FP32, FP16, and BF16 data types
Handles axis attribute for normalization dimensions
Supports epsilon and stash_type attributes per ONNX spec

Changes:

onnxOpImporters.cpp: Add RMSNormalization importer using TensorRT primitive operations (ElementWise, Reduce, Unary)
onnxOpCheckers.cpp: Add empty checker for RMSNormalization
docs/operators.md: Add RMSNormalization to supported operators matrix
onnx_backend_test.py: Include RMSNormalization tests

Fixes onnx/onnx-tensorrt#4639 (via NVIDIA/TensorRT#4639)

Implements RMSNormalization operator for TensorRT ONNX parser, enabling deployment of modern transformer architectures (LLaMA, Mistral, etc.) that use RMSNorm instead of LayerNorm. Implementation details: - Computes Y = (X / sqrt(mean(X^2) + epsilon)) * scale - Supports FP32, FP16, and BF16 data types - Handles axis attribute for normalization dimensions - Supports epsilon and stash_type attributes per ONNX spec Changes: - onnxOpImporters.cpp: Add RMSNormalization importer using TensorRT primitive operations (ElementWise, Reduce, Unary) - onnxOpCheckers.cpp: Add empty checker for RMSNormalization - docs/operators.md: Add RMSNormalization to supported operators matrix - onnx_backend_test.py: Include RMSNormalization tests Fixes onnx/onnx-tensorrt#4639 (via NVIDIA/TensorRT#4639) Signed-off-by: Aditi_Pandey <54734131+AditiThirdEye@users.noreply.github.com>

AditiThirdEye · 2026-01-03T07:24:46Z

@kevinch-nv @yuanyao-nv Could you please review this PR when you have a chance?

This adds ONNX opset 23 RMSNormalization support, enabling deployment of modern LLM architectures (LLaMA, Mistral, etc.) that use RMSNorm.

Related issue: NVIDIA/TensorRT#4639

Thanks!

yuanyao-nv · 2026-01-05T05:50:25Z

Thanks for your contribution. RMSNorm support will be available in the 10.15 release, please stay tuned.

AditiThirdEye · 2026-01-05T05:54:08Z

Thanks @yuanyao-nv! Just to clarify - will this PR be considered for the 10.15 release, or is there already an internal implementation in progress?
Happy to address any feedback if you'd like to use this PR, or close it if there's already work underway internally.

yuanyao-nv · 2026-01-05T06:41:43Z

It's the latter. There is already an internal implementation ready to be released in 10.15.

AditiThirdEye force-pushed the feature/add-rmsnormalization-opset23 branch from d13be85 to 5ec6d8c Compare December 29, 2025 04:47

AditiThirdEye mentioned this pull request Dec 29, 2025

request: Add ONNX opset 23 RMSNormalization support to TensorRT ONNX parser NVIDIA/TensorRT#4639

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ONNX opset 23 RMSNormalization operator support#1046

Add ONNX opset 23 RMSNormalization operator support#1046
AditiThirdEye wants to merge 1 commit intoonnx:10.14-GAfrom
AditiThirdEye:feature/add-rmsnormalization-opset23

AditiThirdEye commented Dec 29, 2025

Uh oh!

AditiThirdEye commented Jan 3, 2026

Uh oh!

yuanyao-nv commented Jan 5, 2026

Uh oh!

AditiThirdEye commented Jan 5, 2026

Uh oh!

yuanyao-nv commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AditiThirdEye commented Dec 29, 2025

Uh oh!

AditiThirdEye commented Jan 3, 2026

Uh oh!

yuanyao-nv commented Jan 5, 2026

Uh oh!

AditiThirdEye commented Jan 5, 2026

Uh oh!

yuanyao-nv commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants