Skip to content

Commit 6d16210

Browse files
erwei-xilinxclaude
andcommitted
Update triton-shared submodule and remove deprecated TritonToLinalg pass
Update triton_shared submodule from 1af6a5f to 209a064 (latest facebookincubator/triton-shared main), which removes the deprecated monolithic TritonToLinalg pass. Regenerate triton_shared.patch to remove redundant TritonToLinalg deletion hunks now handled upstream. The amd_triton_npu backend exclusively uses --triton-to-linalg-experimental (TritonArithToLinalg + TritonToLinalgExperimental), so this removal has no functional impact. Also update README triton-shared link to point to the facebookincubator fork. Tested: full build + matmul on NPU hardware (AIE2) — identical results to unmodified main. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
1 parent fd6d571 commit 6d16210

3 files changed

Lines changed: 3 additions & 5525 deletions

File tree

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ Triton-XDNA provides an end-to-end compilation flow that lowers standard Triton
66

77
### How it works
88

9-
Triton kernels are first lowered to compact Linalg compute graphs via [triton-shared](https://github.com/microsoft/triton-shared), then tiled and mapped onto parallel NPU cores using the MLIR Transform dialect, and finally compiled through [MLIR-AIR](https://github.com/Xilinx/mlir-air) and [MLIR-AIE](https://github.com/Xilinx/mlir-aie) to produce device binaries.
9+
Triton kernels are first lowered to compact Linalg compute graphs via [triton-shared](https://github.com/facebookincubator/triton-shared), then tiled and mapped onto parallel NPU cores using the MLIR Transform dialect, and finally compiled through [MLIR-AIR](https://github.com/Xilinx/mlir-air) and [MLIR-AIE](https://github.com/Xilinx/mlir-aie) to produce device binaries.
1010

1111
```
1212
Triton kernel (@triton.jit)

third_party/triton_shared

Submodule triton_shared updated 111 files

0 commit comments

Comments
 (0)