Feat: export dq only by Giuseppe5 · Pull Request #110 · huggingface/optimum-amd

Giuseppe5 · 2024-03-14T14:57:03Z

Supersedes #94

The idea is to export only Integer weights + DQ. For this, we need to use PyTorch 2.2+ because of a bug in how constant values are handled at export time.

dineshchitlangia

Overall, LGTM.
Minor question pls:
export_manager.change_weight_export(export_weight_q_node=True)
In your change, how are we accounting for this weight export?

Giuseppe5 · 2024-03-15T07:37:37Z

Last time I discussed with @fxmarty, he mentioned that it would be always preferable to export only Integer Weights -> DQ rather than Float Weights -> Q -> DQ, so this last option has been completely removed in this PR.

This is also related to the changes in #82 , where we assume that the weights have been exported without the Q node.

dineshchitlangia · 2024-03-18T05:01:53Z

Last time I discussed with @fxmarty, he mentioned that it would be always preferable to export only Integer Weights -> DQ rather than Float Weights -> Q -> DQ, so this last option has been completely removed in this PR.

This is also related to the changes in #82 , where we assume that the weights have been exported without the Q node.

Thanks for the clarity @Giuseppe5

I do not have write privileges to merge your changes so you will have to wait a bit more until someone can merge it.

mht-sharma

Thanks for the changes. I have left a few comments

mht-sharma · 2024-03-18T05:20:04Z

examples/quantization/brevitas/quantize_llm.py

 from optimum.amd.brevitas.accelerate_utils import calc_cpu_device_map, calc_gpu_device_map, offload_model, remove_hooks
 from optimum.amd.brevitas.data_utils import compute_perplexity, get_dataset_for_model
-from optimum.exporters.onnx import onnx_export_from_model
+from optimum.amd.brevitas.export import export_quantized_model


Suggested change

from optimum.amd.brevitas.export import export_quantized_model

from optimum.amd.brevitas.export import export_to_onnx

Can we keep the ONNX word in the loop to make it explicit. Other name suggestions, quantized_model_to_onnx or save_quantized_model_as_onnx

I opted to keep as similar as possible to the original name so it became:
onnx_export_from_quantized_model

optimum/amd/brevitas/export.py

mht-sharma · 2024-03-18T05:52:04Z

Could you also document the same in docs/brevitas

mht-sharma

LGTM!

optimum/amd/brevitas/export.py

Co-authored-by: Mohit Sharma <mohit21sharma.ms@gmail.com>

* Feat: export dq only * fix * fix * Code review * Docs: update documentation * Formatting * Apply suggestions from code review Co-authored-by: Mohit Sharma <mohit21sharma.ms@gmail.com> --------- Co-authored-by: Mohit Sharma <mohit21sharma.ms@gmail.com>

Feat: export dq only

38a9db7

Giuseppe5 mentioned this pull request Mar 14, 2024

Add ONNX rewriter #82

Merged

Giuseppe5 added 2 commits March 14, 2024 16:44

fix

624baa3

fix

28965df

Giuseppe5 requested a review from mht-sharma March 14, 2024 16:49

dineshchitlangia approved these changes Mar 15, 2024

View reviewed changes

mht-sharma reviewed Mar 18, 2024

View reviewed changes

Giuseppe5 added 2 commits March 18, 2024 14:01

Code review

3afeddf

Docs: update documentation

122f95d

Giuseppe5 requested a review from mht-sharma March 18, 2024 14:14

Formatting

434533a

mht-sharma approved these changes Mar 19, 2024

View reviewed changes

optimum/amd/brevitas/export.py Outdated Show resolved Hide resolved

optimum/amd/brevitas/export.py Show resolved Hide resolved

Apply suggestions from code review

848eaa1

Co-authored-by: Mohit Sharma <mohit21sharma.ms@gmail.com>

mht-sharma merged commit 8771d5c into huggingface:main Mar 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: export dq only#110

Feat: export dq only#110
mht-sharma merged 7 commits intohuggingface:mainfrom
Giuseppe5:dq_flag

Giuseppe5 commented Mar 14, 2024 •

edited

Loading

Uh oh!

dineshchitlangia left a comment

Uh oh!

Giuseppe5 commented Mar 15, 2024

Uh oh!

dineshchitlangia commented Mar 18, 2024

Uh oh!

mht-sharma left a comment •

edited

Loading

Uh oh!

mht-sharma Mar 18, 2024

Uh oh!

Giuseppe5 Mar 18, 2024

Uh oh!

Uh oh!

mht-sharma commented Mar 18, 2024 •

edited

Loading

Uh oh!

mht-sharma left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	from optimum.amd.brevitas.export import export_quantized_model
	from optimum.amd.brevitas.export import export_to_onnx

Conversation

Giuseppe5 commented Mar 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dineshchitlangia left a comment

Choose a reason for hiding this comment

Uh oh!

Giuseppe5 commented Mar 15, 2024

Uh oh!

dineshchitlangia commented Mar 18, 2024

Uh oh!

mht-sharma left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mht-sharma Mar 18, 2024

Choose a reason for hiding this comment

Uh oh!

Giuseppe5 Mar 18, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mht-sharma commented Mar 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mht-sharma left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Giuseppe5 commented Mar 14, 2024 •

edited

Loading

mht-sharma left a comment •

edited

Loading

mht-sharma commented Mar 18, 2024 •

edited

Loading