Skip to content

Commit 33be906

Browse files
njriasanfacebook-github-bot
authored andcommitted
Add best config for FP8 medium shapes
Summary: Updates the shape chosen to fit more medium shapes. Differential Revision: D75103395
1 parent f7dbd7d commit 33be906

File tree

1 file changed

+28
-0
lines changed

1 file changed

+28
-0
lines changed

fbgemm_gpu/experimental/gemm/triton_gemm/fp8_gemm.py

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3342,6 +3342,34 @@ def get_full_non_persistent_tuning_space():
33423342
num_warps=8,
33433343
num_stages=2,
33443344
),
3345+
triton.Config(
3346+
{
3347+
"BLOCK_M": 256,
3348+
"BLOCK_N": 256,
3349+
"BLOCK_K": 128,
3350+
"GROUP_M": 2,
3351+
"SPLIT_K": 1,
3352+
"waves_per_eu": 0,
3353+
"matrix_instr_nonkdim": 32,
3354+
"kpack": 2,
3355+
},
3356+
num_warps=8,
3357+
num_stages=2,
3358+
),
3359+
triton.Config(
3360+
{
3361+
"BLOCK_M": 256,
3362+
"BLOCK_N": 128,
3363+
"BLOCK_K": 128,
3364+
"GROUP_M": 4,
3365+
"SPLIT_K": 1,
3366+
"waves_per_eu": 0,
3367+
"matrix_instr_nonkdim": 16,
3368+
"kpack": 1,
3369+
},
3370+
num_warps=8,
3371+
num_stages=2,
3372+
),
33453373
]
33463374

33473375

0 commit comments

Comments
 (0)