Skip to content

Commit 5b046e3

Browse files
adamomainzfacebook-github-bot
authored andcommitted
adding gbps to fp8 gemm rowwise
Summary: TSIA Reviewed By: bertmaher Differential Revision: D70415169 fbshipit-source-id: ff7fc85584efd1366dfad450efcd5d6de2054419
1 parent fb5e940 commit 5b046e3

File tree

1 file changed

+12
-0
lines changed

1 file changed

+12
-0
lines changed

tritonbench/operators/fp8_gemm_rowwise/operator.py

+12
Original file line numberDiff line numberDiff line change
@@ -204,6 +204,18 @@ def flops(
204204
flops = m * k * 2 * n
205205
return flops
206206

207+
@register_metric()
208+
def gbps(self, fn, example_inputs: Any, metrics: BenchmarkOperatorMetrics) -> float:
209+
def nbytes(t):
210+
return t.numel() * t.element_size()
211+
212+
a, b, _, _ = example_inputs
213+
c = fn()
214+
c = c[0] if isinstance(c, tuple) else c
215+
216+
gb = (nbytes(a) + nbytes(b) + nbytes(c)) / 1e9
217+
return gb / metrics.latency * 1e3
218+
207219
@register_x_val(label="(M, N, K)")
208220
def get_x_val(self, example_inputs) -> Tuple[int, int, int]:
209221
xq, wq, _, _ = example_inputs

0 commit comments

Comments
 (0)