Skip to content

Migrate backend quantizers/partitioners to use new torchao quant primitives #9513

Open
@metascroy

Description

@metascroy

🚀 The feature, motivation and pitch

PT2E will be copied over to torchao, and migrate to using the new torchao quant primitives. The existing PT2E code will remain in pytorch/ao with the old quant primitives to avoid breaking existing code.

Backends should migrate to using PT2E in torchao and recognizing the new torchao quant primitives during lowering.

By standardizing on torchao quant primitives, model lowering will "just work" in common scenarios, even if a user quantizes the model using torchao's quantize_.

Alternatives

No response

Additional context

No response

RFC (Optional)

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    triagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions