Skip to content

Adding a script to test accuracy on IPU provider after quantization with Quark#148

Open
hanlin0628 wants to merge 18 commits intohuggingface:mainfrom
hanlin0628:quark
Open

Adding a script to test accuracy on IPU provider after quantization with Quark#148
hanlin0628 wants to merge 18 commits intohuggingface:mainfrom
hanlin0628:quark

Conversation

@hanlin0628
Copy link
Copy Markdown

No description provided.

@mht-sharma
Copy link
Copy Markdown
Contributor

Hi @hanlin0628,

I have been working on integrating the Quark quantizer into optimum-amd through this pull request: #149. I noticed a few things and would like to clarify the following:

  • Is the vaiq_onnx quantiser deprecated?

  • The Quark quantiser used in this PR appears to be different from the Quark torch quantizer and seems to be ONNX-based (as per Quark documentation). Do we have two different sources for the Quark quantiser?

If the interfaces are similar, I would like to discuss how we can better integrate them into a single OptimumQuantizer.

@hanlin0628
Copy link
Copy Markdown
Author

hanlin0628 commented Jul 27, 2024 via email

@hanlin0628
Copy link
Copy Markdown
Author

Hi Mohit @mht-sharma

Could you please review the recent update and help me to merge?

Thanks,
Han

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants