Update triton-shared submodule and remove deprecated TritonToLinalg pass

erwei-xilinx · claude · erwei-xilinx · commit 6d1621054d6a · 2026-02-27T14:28:47.000-08:00
Update triton_shared submodule from 1af6a5f to 209a064 (latest
facebookincubator/triton-shared main), which removes the deprecated
monolithic TritonToLinalg pass. Regenerate triton_shared.patch to
remove redundant TritonToLinalg deletion hunks now handled upstream.

The amd_triton_npu backend exclusively uses --triton-to-linalg-experimental
(TritonArithToLinalg + TritonToLinalgExperimental), so this removal has
no functional impact. Also update README triton-shared link to point to
the facebookincubator fork.

Tested: full build + matmul on NPU hardware (AIE2) — identical results
to unmodified main.

Co-Authored-By: Claude Opus 4.6 &lt;noreply@anthropic.com&gt;
diff --git a/README.md b/README.md
@@ -6,7 +6,7 @@ Triton-XDNA provides an end-to-end compilation flow that lowers standard Triton
 
 ### How it works
 
-Triton kernels are first lowered to compact Linalg compute graphs via [triton-shared](https://github.com/microsoft/triton-shared), then tiled and mapped onto parallel NPU cores using the MLIR Transform dialect, and finally compiled through [MLIR-AIR](https://github.com/Xilinx/mlir-air) and [MLIR-AIE](https://github.com/Xilinx/mlir-aie) to produce device binaries.
+Triton kernels are first lowered to compact Linalg compute graphs via [triton-shared](https://github.com/facebookincubator/triton-shared), then tiled and mapped onto parallel NPU cores using the MLIR Transform dialect, and finally compiled through [MLIR-AIR](https://github.com/Xilinx/mlir-air) and [MLIR-AIE](https://github.com/Xilinx/mlir-aie) to produce device binaries.
 
 ```
 Triton kernel (@triton.jit)
diff --git a/third_party/triton_shared b/third_party/triton_shared
@@ -1 +1 @@
-Subproject commit 1af6a5ff28add4813f36f1d46d11b559a563e9a2
+Subproject commit 08684f92ad30696362dce1760a83be889639a3e4
diff --git a/third_party/triton_shared.patch b/third_party/triton_shared.patch