Minor refactor for LLMC #993

yiliu30 · 2025-11-05T05:07:38Z

No description provided.

Signed-off-by: yiliu30 <[email protected]>

…llm-ext

Signed-off-by: yiliu30 <[email protected]>

Signed-off-by: yi <[email protected]>

Signed-off-by: root <[email protected]>

Signed-off-by: yiliu30 <[email protected]>

wenhuach21 · 2025-11-05T05:16:37Z

auto_round/compressors/base.py

-                block, self.device_map, input_ids, self.low_gpu_mem_usage, self.batch_size, device
-            )
+        if normalize_inputs:
+            input_ids, input_others = normalize_input(inputs)


why not moving these change to llmc side

wenhuach21 · 2025-11-05T05:17:14Z

auto_round/compressors/base.py


+# function decorator to dump the func time
+def time_logger(func: Callable) -> Callable:
+    """Decorator to log the execution time of a function.


move to utils file

wenhuach21 · 2025-11-05T05:17:59Z

auto_round/compressors/base.py

            self.inner_supported_types = tuple(x for x in INNER_SUPPORTED_LAYER_TYPES if x != "FP8Linear")
-        self.batch_dim = None
+        # TODO: check with heng/weiwei
+        self.batch_dim = 0


if this is required, hidden in kwargs, and add comments

wenhuach21 · 2025-11-05T05:18:21Z

auto_round/compressors/base.py

            q_inputs = q_inputs.pop(input_id_str[0], None)
        return inputs, q_inputs

+    def configure_layer_config(self, enable_gguf_official_mixed: None | bool = False):


better set the enable_gguf_official_mixed to True by default

wenhuach21 · 2025-11-05T05:20:47Z

auto_round/compressors/base.py

        q_input: Union[torch.Tensor, dict, None] = None,
+        normalize_inputs: bool = False,
        device: Union[str, torch.device] = "cpu",
+        auto_offload=True


@n1ck-guo is working on splitting this function. We should avoid fusing too many operations that are not actually used by AutoRound itself in this function.

yiliu30 and others added 30 commits October 22, 2025 22:32

init moe support

5ee2c2d

Signed-off-by: yiliu30 <[email protected]>

add test

c278f9d

Signed-off-by: yiliu30 <[email protected]>

fix import

418e6a0

Signed-off-by: yiliu30 <[email protected]>

clean envs

184783f

Signed-off-by: yiliu30 <[email protected]>

add script for apply ext

b9da06f

Signed-off-by: yiliu30 <[email protected]>

clean docs

187f38d

Signed-off-by: yiliu30 <[email protected]>

fix license

4031724

Signed-off-by: yiliu30 <[email protected]>

fix

5fe01ef

Signed-off-by: yiliu30 <[email protected]>

fix import and sitecustomize

73f1e9b

Signed-off-by: yiliu30 <[email protected]>

move to ext

8495854

Signed-off-by: yiliu30 <[email protected]>

update mxfp4

c473934

Signed-off-by: yiliu30 <[email protected]>

fix

9f65bd1

Signed-off-by: yiliu30 <[email protected]>

fix model name

8038a5f

Signed-off-by: yiliu30 <[email protected]>

Merge branch 'main' into vllm-ext

e0872b6

fix

c82bce1

Signed-off-by: yiliu30 <[email protected]>

Merge branch 'vllm-ext' of https://github.com/intel/auto-round into v…

19e18c7

…llm-ext

use absolute path

adf7ebf

Signed-off-by: yiliu30 <[email protected]>

Merge branch 'main' into vllm-ext

59f5cd2

Merge branch 'main' into vllm-ext

8f27041

Signed-off-by: yiliu30 <[email protected]>

fix

ad8537c

Signed-off-by: yiliu30 <[email protected]>

mark round method as todo

77844f6

Signed-off-by: yiliu30 <[email protected]>

tmp wa for llmc

ce985ef

Signed-off-by: yiliu30 <[email protected]>

tmp wa for llmc

8832530

Signed-off-by: yiliu30 <[email protected]>

return ds

361491f

Signed-off-by: yiliu30 <[email protected]>

add more log

db65d74

Signed-off-by: yiliu30 <[email protected]>

refine code

60a0023

Signed-off-by: yiliu30 <[email protected]>

Merge branch 'llmc' of https://github.com/intel/auto-round into llmc

2f96c13

refactor

7a1716e

Signed-off-by: yi <[email protected]>

refactor

a20f9df

Signed-off-by: root <[email protected]>

fix offloaf

553ee5c

Signed-off-by: root <[email protected]>

fix

2bd3c4b

Signed-off-by: root <[email protected]>

yiliu30 requested a review from wenhuach21 November 5, 2025 05:08

yiliu30 added 2 commits November 5, 2025 00:10

remove time

b992c31

Signed-off-by: yiliu30 <[email protected]>

update

0354c2b

Signed-off-by: yiliu30 <[email protected]>

wenhuach21 reviewed Nov 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Minor refactor for LLMC #993

Minor refactor for LLMC #993

yiliu30 commented Nov 5, 2025

Uh oh!

wenhuach21 Nov 5, 2025

Uh oh!

wenhuach21 Nov 5, 2025

Uh oh!

wenhuach21 Nov 5, 2025

Uh oh!

wenhuach21 Nov 5, 2025

Uh oh!

wenhuach21 Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Minor refactor for LLMC #993

Are you sure you want to change the base?

Minor refactor for LLMC #993

Conversation

yiliu30 commented Nov 5, 2025

Uh oh!

wenhuach21 Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

wenhuach21 Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

wenhuach21 Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

wenhuach21 Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

wenhuach21 Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants