Skip to content

How did you quantize the model? #5

@Ph0rk0z

Description

@Ph0rk0z

I have been trying to use other kernels with this implementation but none of them load the state dict without mismatch. I don't know if marlin AWQ is special, but in any case, it would be nice to know how to quantize models. Even with this implementation we don't have schnell.

Please post a quanting script or give some hints.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions