Skip to content

No support for block_scale_quantize in the python graph API #173

@cbcase

Description

@cbcase

Describe the bug
The python frontend graph API is missing the block_scale_quantize API. Interestingly, it does have block_scale_dequantize:

>>> graph.block_scale_quantize
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
AttributeError: 'cudnn._compiled_module.pygraph' object has no attribute 'block_scale_quantize'. Did you mean: 'block_scale_dequantize'?

Expected behavior
The python API here should match the cpp API.

System Environment (please complete the following information):

  • cudnn_frontend version: 1.14.1
  • cudnn_backend version: 9.8.0
  • GPU arch: B200
  • cuda runtime version: 12.8
  • cuda driver version: 575.57.08
  • host compiler: unsure
  • OS: ubuntu22.04

Additional context
This looks like a simple fix to add the relevant wrapper function to python/pygraph/pygraph.cpp, but it's possible I'm missing something.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions