Skip to content

U2 dq mat mul rebase#36329

Draft
akodanka wants to merge 2 commits into
openvinotoolkit:masterfrom
akodanka:u2_DQMatMul_rebase
Draft

U2 dq mat mul rebase#36329
akodanka wants to merge 2 commits into
openvinotoolkit:masterfrom
akodanka:u2_DQMatMul_rebase

Conversation

@akodanka

@akodanka akodanka commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

Details:

  • item1
  • ...

Tickets:

  • ticket-id

AI Assistance:

  • AI assistance used: no / yes
  • If yes, summarize how AI was used and what human validation was performed (build/tests/manual checks).

@github-actions github-actions Bot added category: NPU OpenVINO NPU plugin category: NPUW NPUW plugin labels Jun 9, 2026
akodanka added 2 commits June 9, 2026 23:23
Introduce a new graph-rewrite pass (DQMatMulCWuAsymm) that restructures the
asymmetric dequantize-then-MatMul subgraph:

  cast(W) -> Subtract(Z) -> Multiply(S) -> MatMul(Act)

into a form where the quantized weights remain at the MatMul input:

  MatMul(Act, cast(W)) -> Subtract(ReduceSum(Act)*Z) -> Multiply(S)

This allows the NPU sub-compiler to fuse the Convert+MatMul into a native
quant-MatMul kernel, while the zero-point and scale corrections are applied
post-MatMul on small tensors.

Supported weight types: u2, u4, u8, i4, i8 with f32 zero-point and scale
of shape [N, 1] (channel-wise).

Register the new matcher in Partitioner::optimize() alongside DQMatMulCWi.

Signed-off-by: Anoob Anto Kodankandath <anoob.anto.kodankandath@intel.com>
The CWAI path in getPartitioning() was missing the optimize() call that
the FOLD path already makes. Without it, the DQ graph-rewrite passes
are never run.

Add p.optimize so that NPUW_DQ-gated rewrites apply on the CWAI path.

Signed-off-by: Anoob Anto Kodankandath <anoob.anto.kodankandath@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: NPU OpenVINO NPU plugin category: NPUW NPUW plugin

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant