You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The FLOPs numbers in throughput.py for H100 and H200 GPUs have some inconsistencies that have caused me some confusion when switching between GPU types:
all H200 and the H100 NVL GPU seem to be using sparse FLOPs numbers
I think it would be good to standardize on one and also document the behaviour as there's a factor 2 difference between the two.
What version are you seeing the problem on?
master
Reproduced in studio
No response
How to reproduce the bug
Error messages and logs
# Error messages and logs here please
Environment
Current environment
#- PyTorch Lightning Version (e.g., 2.6.0):
#- PyTorch Version (e.g., 2.5):
#- Python version (e.g., 3.12):
#- OS (e.g., Linux):
#- CUDA/cuDNN version:
#- GPU models and configuration:
#- How you installed Lightning(`conda`, `pip`, source):
Bug description
The FLOPs numbers in throughput.py for H100 and H200 GPUs have some inconsistencies that have caused me some confusion when switching between GPU types:
I think it would be good to standardize on one and also document the behaviour as there's a factor 2 difference between the two.
What version are you seeing the problem on?
master
Reproduced in studio
No response
How to reproduce the bug
Error messages and logs
Environment
Current environment
More info
No response
cc @ethanwharris