Skip to content

Commit 5cd9e26

Browse files
committed
feat: Add metrics screenshots
1 parent fde7275 commit 5cd9e26

File tree

1 file changed

+7
-0
lines changed

1 file changed

+7
-0
lines changed

README.md

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -75,6 +75,13 @@ $ cargo install gpu-fryer
7575
GPU fryer creates two 8192x8192 matrix and performs a matrix multiplication using CUBLAS.
7676
Test allocates 95% of the GPU memory to write results in a ring buffer fashion.
7777

78+
If GPU is BF16 capable, it will use BF16 precision instead of FP32 to stress the Tensor Cores.
79+
80+
With a 8xNVIDIA H100 80GB HBM3 system, we get the following results:
81+
82+
![utilization.png](assets/utilization.png)
83+
![tensorcores.png](assets/tensorcores.png)
84+
7885
## Acknowledgements
7986

8087
The awesome [GPU Burn](https://github.com/wilicc/gpu-burn), very similar tool but looking at computational errors.

0 commit comments

Comments
 (0)