-
Notifications
You must be signed in to change notification settings - Fork 135
Open
Labels
Description
Describe the bug
The python frontend graph API is missing the block_scale_quantize API. Interestingly, it does have block_scale_dequantize:
>>> graph.block_scale_quantize
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
AttributeError: 'cudnn._compiled_module.pygraph' object has no attribute 'block_scale_quantize'. Did you mean: 'block_scale_dequantize'?
Expected behavior
The python API here should match the cpp API.
System Environment (please complete the following information):
- cudnn_frontend version: 1.14.1
- cudnn_backend version: 9.8.0
- GPU arch: B200
- cuda runtime version: 12.8
- cuda driver version: 575.57.08
- host compiler: unsure
- OS: ubuntu22.04
Additional context
This looks like a simple fix to add the relevant wrapper function to python/pygraph/pygraph.cpp, but it's possible I'm missing something.