-
Notifications
You must be signed in to change notification settings - Fork 50
Pull requests: meta-pytorch/tritonbench
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Pipeline the TMA desc creation for TLX groupedGEMM
cla signed
fb-exported
meta-exported
#685
opened Dec 4, 2025 by
htyu
Loading…
Explicitly zero grad in flash_attention Triton Bench
cla signed
fb-exported
meta-exported
#683
opened Dec 3, 2025 by
sryap
Loading…
Generate GPU L2 size inputs in flash_attention Triton Bench
cla signed
fb-exported
meta-exported
#681
opened Dec 3, 2025 by
sryap
Loading…
[install] fix rocm docker install
ciflow/rocm
cla signed
module: rocm
#679
opened Dec 3, 2025 by
xuzhao9
Loading…
Update pyfmt component on FBS:master
cla signed
fb-exported
meta-exported
#661
opened Nov 22, 2025 by
bowiechen
Loading…
[WIP][aiter][flash_attention] update aiter and add attn
cla signed
#612
opened Nov 1, 2025 by
xuzhao9
Loading…
[DO NOT LAND] Test run for MLP bias accuracy issue
cla signed
#609
opened Oct 31, 2025 by
xuzhao9
Loading…
Add backward compatibility for TensorDescriptor
cla signed
#457
opened Sep 19, 2025 by
bdbowyer
Loading…
Add a Blackwell-specific scaled persistent + TMA template for GEMMs
cla signed
fb-exported
meta-exported
#432
opened Sep 17, 2025 by
jananisriram
Loading…
adding arguments to add_benchmark to match registry
cla signed
fb-exported
#381
opened Sep 2, 2025 by
adamomainz
Loading…
Add trtlllm to triton bench
cla signed
fb-exported
meta-exported
#379
opened Aug 29, 2025 by
Aya-ZIbra
Loading…
Add cutlass decode kernel to TritonBench
cla signed
fb-exported
meta-exported
#376
opened Aug 28, 2025 by
Aya-ZIbra
Loading…
Validate exhaustive autotuning for FP8 Inductor templates
cla signed
fb-exported
#355
opened Aug 25, 2025 by
jananisriram
Loading…
[DO NOT LAND] Try always enabling cuda graph
cla signed
#348
opened Aug 21, 2025 by
xuzhao9
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-12-01.