inplace hadamard by wenhuach21 · Pull Request #1641 · intel/auto-round

wenhuach21 · 2026-03-31T03:11:48Z

Description

Please briefly describe your main changes, the motivation.

delay rotation to save ram/vram
inference

for more information, see https://pre-commit.ci

Copilot

Pull request overview

This PR appears to change the experimental Hadamard-transform workflow (generation + application/patching) and introduces an “inplace” LLaMA2 rotation utility, while also attempting to enable a default Hadamard config from the CLI.

Changes:

Modified random Hadamard matrix construction and related docs in the transform utilities.
Refactored Hadamard application/patching logic (apply + WrapperLinear monkey-patches).
Added an experimental LLaMA2 “inplace” rotation module and changed CLI/BaseCompressor Hadamard configuration behavior.

Reviewed changes

Copilot reviewed 8 out of 9 changed files in this pull request and generated 9 comments.

Show a summary per file

File	Description
auto_round/experimental/transform/utils/hadamard.py	Alters random Hadamard generation and leaves additional commented code.
auto_round/experimental/transform/patch_modules.py	Changes WrapperLinear monkey-patching to apply transforms before quantization.
auto_round/experimental/transform/apply.py	Updates Hadamard transform application (notably config serialization calls).
auto_round/experimental/hadamard_inplace/llama2.py	Adds LLaMA2-specific rotation utilities (currently includes import-time script logic).
auto_round/compressors/base.py	Changes how Hadamard config is handled during compressor init.
auto_round/main.py	Forces a default Hadamard config when invoking `tune()`.

Comments suppressed due to low confidence (2)

auto_round/experimental/transform/apply.py:100

In the input-transform branch, precision=module.dtype will fail for torch.nn.Linear (no .dtype attribute). Use module.weight.dtype (and similarly handle MXQuantLinearBase as needed) to avoid AttributeError.

    if location == "input":

        # activation needs transpose
        input_hadamard_transform = build_hadamard_transform(
            **config.model_dump(),
            location="input",
            inverse=True,
            device="cpu",
            precision=module.dtype,
        )

auto_round/experimental/transform/patch_modules.py:64

_qdq_act_patched assigns self.origin_qdq_act = self._qdq_act after the monkey-patch, so self._qdq_act already points to _qdq_act_patched. Calling self.origin_qdq_act(...) then recurses indefinitely. Capture the original method in a closure (e.g., orig_qdq_act = WrapperLinear._qdq_act) before patching and call that instead.

        self.origin_qdq_act = self._qdq_act
        x = inp_transform(x)

        return self.origin_qdq_act(x, act_max_scale, act_max)

    WrapperLinear._qdq_weight = _qdq_weight_patched
    WrapperLinear._qdq_act = _qdq_act_patched
    WrapperLinear._hadamard_patched = True


def patch_wrapperwalayer_forward_to_apply_transform(inp_transform):

auto_round/experimental/transform/utils/hadamard.py

Copilot · 2026-03-31T03:16:43Z

auto_round/experimental/transform/utils/hadamard.py

@@ -70,8 +69,8 @@ def random_hadamard_matrix(
    :param gen: Optional generator random values
    :return: randomly generated hadamard matrix


The random_hadamard_matrix docstring still documents a dtype parameter, but the function signature no longer accepts it. Either reintroduce the dtype argument or update the docstring to avoid misleading callers.

auto_round/experimental/transform/utils/hadamard.py

Copilot · 2026-03-31T03:16:43Z

auto_round/experimental/transform/apply.py

-        if "lm_head" in name:
+        if "lm_head" in name:  # TODO unrobust
            continue
        _apply_to_module(model, module, config, need_calibration, location)


apply_hadamard_transform still calls _apply_to_module(model, module, config, need_calibration, location), but _apply_to_module now only accepts (module, config, location). This will raise TypeError at runtime; update the call and/or restore the removed parameters.

Suggested change

_apply_to_module(model, module, config, need_calibration, location)

_apply_to_module(module, config, location)

auto_round/compressors/base.py

Copilot · 2026-03-31T03:16:44Z

auto_round/__main__.py

        model_dtype=args.model_dtype,
        momentum=args.momentum,
        trust_remote_code=not args.disable_trust_remote_code,
+        hadamard_config="default",


This forces hadamard_config="default" for every CLI tune run, changing default behavior and (with the current BaseCompressor changes) triggering the hadamard path without actually applying it / initializing self.hadamard_config, which can lead to downstream failures during save/export. Consider making this opt-in via a CLI flag or keep the previous default None.

Suggested change

hadamard_config="default",

hadamard_config=None,

auto_round/experimental/transform/patch_modules.py

auto_round/experimental/hadamard_inplace/llama2.py

for more information, see https://pre-commit.ci

…into hadamard_change # Conflicts: # auto_round/experimental/hadamard_inplace/llama2.py # auto_round/experimental/hadamard_inplace/utils.py

for more information, see https://pre-commit.ci

…tion

for more information, see https://pre-commit.ci

…into hadamard_change

for more information, see https://pre-commit.ci

…into hadamard_change

for more information, see https://pre-commit.ci

…into hadamard_change

for more information, see https://pre-commit.ci

…into hadamard_change Signed-off-by: Wenhua Cheng <wenhua.cheng@intel.com>

for more information, see https://pre-commit.ci

tmp change

ff91f07

Copilot AI review requested due to automatic review settings March 31, 2026 03:11

wenhuach21 marked this pull request as draft March 31, 2026 03:11

Copilot started reviewing on behalf of wenhuach21 March 31, 2026 03:12 View session

[pre-commit.ci] auto fixes from pre-commit.com hooks

65f7a6f

for more information, see https://pre-commit.ci

Copilot AI reviewed Mar 31, 2026

View reviewed changes

wenhuach21 and others added 14 commits April 7, 2026 16:58

o_proj quantization has not been handled

267b6a2

update

9609db3

update

c13d24b

[pre-commit.ci] auto fixes from pre-commit.com hooks

eb3ee0a

for more information, see https://pre-commit.ci

update

f41771c

Merge branch 'hadamard_change' of https://github.com/intel/auto-round …

1455702

…into hadamard_change # Conflicts: # auto_round/experimental/hadamard_inplace/llama2.py # auto_round/experimental/hadamard_inplace/utils.py

[pre-commit.ci] auto fixes from pre-commit.com hooks

df14371

for more information, see https://pre-commit.ci

support not using fast hadamard

e3c1d93

[pre-commit.ci] auto fixes from pre-commit.com hooks

02ce508

for more information, see https://pre-commit.ci

update

a978b4c

update

d790ce0

support opt-125m by AI, it seems that the accuracy dropped after rota…

936c2f5

…tion

update

8ca7973

update

c07b3b1

wenhuach21 changed the title ~~[not4landing]hadamard change~~ inplace hadamard Apr 8, 2026

pre-commit-ci bot and others added 9 commits April 8, 2026 09:59

[pre-commit.ci] auto fixes from pre-commit.com hooks

89516fe

for more information, see https://pre-commit.ci

update

3760a3b

Merge branch 'hadamard_change' of https://github.com/intel/auto-round …

8ac1cee

…into hadamard_change

[pre-commit.ci] auto fixes from pre-commit.com hooks

ccf8661

for more information, see https://pre-commit.ci

fix

7a7276e

Merge branch 'hadamard_change' of https://github.com/intel/auto-round …

cb09ae5

…into hadamard_change

[pre-commit.ci] auto fixes from pre-commit.com hooks

a009d9a

for more information, see https://pre-commit.ci

Merge branch 'main' into hadamard_change

8c1b4f6

fix

54b2d61

pre-commit-ci bot and others added 10 commits April 10, 2026 08:13

[pre-commit.ci] auto fixes from pre-commit.com hooks

9d344fc

for more information, see https://pre-commit.ci

tmp change

319ff2d

[pre-commit.ci] auto fixes from pre-commit.com hooks

ef99f90

for more information, see https://pre-commit.ci

support group_size

bf79387

Merge branch 'hadamard_change' of https://github.com/intel/auto-round …

f1173e8

…into hadamard_change

[pre-commit.ci] auto fixes from pre-commit.com hooks

ea4804d

for more information, see https://pre-commit.ci

upate

9ad5113

remove 0.9 scale in act quantization

0678b0d

Merge branch 'hadamard_change' of https://github.com/intel/auto-round …

d961425

…into hadamard_change Signed-off-by: Wenhua Cheng <wenhua.cheng@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

1d94102

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inplace hadamard#1641

inplace hadamard#1641
wenhuach21 wants to merge 35 commits intomainfrom
hadamard_change

wenhuach21 commented Mar 31, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI Mar 31, 2026

Uh oh!

Uh oh!

Copilot AI Mar 31, 2026

Uh oh!

Uh oh!

Copilot AI Mar 31, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -70,8 +69,8 @@ def random_hadamard_matrix(
		:param gen: Optional generator random values
		:return: randomly generated hadamard matrix

	_apply_to_module(model, module, config, need_calibration, location)
	_apply_to_module(module, config, location)

Conversation

wenhuach21 commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wenhuach21 commented Mar 31, 2026 •

edited

Loading