Aanuf/lut per layer merged #3684

andreyanufr · 2025-10-08T17:19:34Z

Changes

Implemented computation of codebook based on k-means algorithm.

Reason for changes

Related tickets

CVS-169609

Tests

…ODEBOOK mode.

2) Added extra advanced parameters for adapriva codebook.

src/nncf/quantization/algorithms/weight_compression/algorithm.py

src/nncf/quantization/algorithms/weight_compression/codebook_estimation.py

daniil-lyakhov · 2026-01-13T10:46:17Z

src/nncf/quantization/algorithms/weight_compression/codebook_estimation.py

+        reduction_axes: tuple[int, ...],
+        config: WeightCompressionConfig,
+        wp: WeightCompressionParameters,
+    ) -> Tensor:


Wrong return type

src/nncf/quantization/algorithms/weight_compression/codebook_estimation.py

daniil-lyakhov · 2026-01-13T10:49:56Z

src/nncf/quantization/algorithms/weight_compression/codebook_estimation.py

+        if self._num_elements == config.get_numpy_codebook().size:
+            variants[0] = fns.tensor(
+                config.get_numpy_codebook().data, backend=weight.backend, dtype=TensorDataType.float16
+            )
+        variants[1] = fns.tensor(
+            list(range(-self._num_elements // 2, self._num_elements - self._num_elements // 2)),
+            backend=weight.backend,
+            dtype=TensorDataType.float16,
+        )


Could you please add comments to this part (in calculate_codebook fn as well) to expain the logic behind this?

src/nncf/quantization/algorithms/weight_compression/codebook_estimation.py

daniil-lyakhov · 2026-01-13T10:53:25Z

tests/openvino/native/quantization/test_weights_compression.py

+@pytest.mark.parametrize("value_type", [None, TensorDataType.float16, TensorDataType.f8e4m3, TensorDataType.int8])
+def test_adaptive_codebooks(value_type):
+    model = AWQMatmulModel().ov_model


Tests with reference codebooks and in per_group / not_per_group woudl be nice

alexsu52 and others added 30 commits September 2, 2024 13:22

Support scale estimation inside GPTQ

488cacc

fix for INT4_ASYM

ee64877

Merge remote-tracking branch 'upstream/develop' into develop

f22e411

Merge remote-tracking branch 'upstream/develop' into develop

51b4d7b

Merge remote-tracking branch 'upstream/develop' into develop

f66cd1e

Merge remote-tracking branch 'upstream/develop' into develop

7ce5a53

Merge remote-tracking branch 'upstream/develop' into develop

f74d156

Merge remote-tracking branch 'upstream/develop' into develop

5288c79

Merge remote-tracking branch 'upstream/develop' into develop

1becf15

Merge remote-tracking branch 'upstream/develop' into develop

047d7d9

Merge remote-tracking branch 'upstream/develop' into develop

c0c7e57

Merge remote-tracking branch 'upstream/develop' into develop

b74dea1

Merge remote-tracking branch 'upstream/develop' into develop

26a9a77

Merge remote-tracking branch 'upstream/develop' into develop

25fcc2c

Merge remote-tracking branch 'upstream/develop' into develop

26d4887

Merge remote-tracking branch 'upstream/develop' into develop

7748233

Merge remote-tracking branch 'upstream/develop' into develop

df251b3

Merge remote-tracking branch 'upstream/develop' into develop

4c134c4

Merge remote-tracking branch 'upstream/develop' into develop

6147097

Merge remote-tracking branch 'upstream/develop' into develop

2b94d28

Merge remote-tracking branch 'upstream/develop' into develop

5e312a5

Merge remote-tracking branch 'upstream/develop' into develop

2c5e983

Merge remote-tracking branch 'upstream/develop' into develop

1d8db1e

Merge remote-tracking branch 'upstream/develop' into develop

7244f18

Merge remote-tracking branch 'upstream/develop' into develop

443048c

Merge remote-tracking branch 'upstream/develop' into develop

80d2d8a

Merge remote-tracking branch 'upstream/develop' into develop

06bb19b

Merge remote-tracking branch 'upstream/develop' into develop

5d97d87

Merge remote-tracking branch 'upstream/develop' into develop

ae7cece

Initial codebook estimation algorithm.

3bcd47b

andreyanufr added 11 commits October 13, 2025 16:37

Fixed bug with codebook type..

c6f72ee

Fixed bug with cb4 codebook conversion to fp8.

8497f4e

Disabled codebook estimation for onnx and torch.

535d2da

Temporal fix for empty cluster.

6b2e7f7

Per MatMul type codebook.

4e54047

Changed interval step to number of intervals.

998f996

Weighted codebook selection.

5708ceb

Changed codebook data type.

aded7f2

Fixed bug.

2aac843

Codebook datatype for experiments.

fb5b7d8

Fixed merge conflicts.

708595a

andreyanufr marked this pull request as ready for review January 7, 2026 08:41

andreyanufr requested a review from a team as a code owner January 7, 2026 08:41

andreyanufr added 2 commits January 7, 2026 14:38

Removed codebook_estimation paramater and replaced it with ADAPTIVE_C…

37765b3

…ODEBOOK mode.

Added adaptive codebook parameters.

97ad50d

github-actions bot added the documentation Improvements or additions to documentation label Jan 9, 2026

andreyanufr added 2 commits January 9, 2026 13:00

1) Added example with adaptive codebook.

8064c3e

2) Added extra advanced parameters for adapriva codebook.

Added example to test.

0be0f7d

andreyanufr requested review from AlexanderDokuchaev and daniil-lyakhov January 9, 2026 15:49

github-actions bot added the NNCF OpenVINO Pull requests that updates NNCF OpenVINO label Jan 12, 2026

andreyanufr added 2 commits January 12, 2026 15:02

Added test for adaptiva codebook.

1b25dc1

Added codebook parameters check.

836d7f8

daniil-lyakhov reviewed Jan 13, 2026

View reviewed changes

andreyanufr added 6 commits January 13, 2026 16:44

Applied comments.

880d5fb

Added support of group_size for per-tensor codebook.

3d00b66

Fixed merge conflict.

3eee52d

Check advanced codebook paramaters only in case of right mode.

c048a6a

Fixed bug in merging.

4e14f96

Fixed bug with empty histogram bins.

ce11a60

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Aanuf/lut per layer merged #3684

Aanuf/lut per layer merged #3684

Uh oh!

andreyanufr commented Oct 8, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

daniil-lyakhov Jan 13, 2026

Uh oh!

Uh oh!

daniil-lyakhov Jan 13, 2026

Uh oh!

Uh oh!

daniil-lyakhov Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Aanuf/lut per layer merged #3684

Are you sure you want to change the base?

Aanuf/lut per layer merged #3684

Uh oh!

Conversation

andreyanufr commented Oct 8, 2025

Changes

Reason for changes

Related tickets

Tests

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

daniil-lyakhov Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

daniil-lyakhov Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

daniil-lyakhov Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants