Ergonomic Triton kernels with autograd support and near speed-of-light performance.
uv pip install tritonximport tritonx as tx
import torch
# Matrix multiplication
out = tx.mm(a, b)
# Element-wise addition
out = tx.add(a, b)
# Einsum operations
out = tx.einsum("ijk,ikl->ijl", a, b)tx.mm- Matrix multiplicationtx.add- Element-wise additiontx.einsum- Einstein summation with autodiff
git clone https://github.com/windsornguyen/tritonx
cd tritonx
uv syncMIT
@software{nguyen2026tritonx,
author = {Nguyen, Windsor},
title = {TritonX: Ergonomic Triton Kernels},
year = {2026},
url = {https://github.com/windsornguyen/tritonx}
}