[FORK][FEATURE][x64] 3D int4 reorders for FC layout#292
Merged
maxnick merged 2 commits intov3.8_for_ie_masterfrom Nov 5, 2025
Merged
[FORK][FEATURE][x64] 3D int4 reorders for FC layout#292maxnick merged 2 commits intov3.8_for_ie_masterfrom
maxnick merged 2 commits intov3.8_for_ie_masterfrom
Conversation
github-merge-queue bot
pushed a commit
to openvinotoolkit/openvino
that referenced
this pull request
Nov 5, 2025
### Details: In this PR we introduce yet another operation "GatherMatmu", which essentially does gemv operations over the current tokens and the active experts. As the first step, we perform gemv operation using the dnnl::inner_product. But obviously this solution is suboptimal, as it doesn't give a fine grain control over parallelization, and in the case of many tokens being processed by a specific expert (prefill), having gemm operation may be more optimal as the tokens may be batched and we can do SIMD level parallelization by tokens as well. Also this PR contains all the essential transformations that allow to enable a few common MoE patterns. MoE pattern matcher is based on #32183 Related oneDNN fork PR: openvinotoolkit/oneDNN#292 ### Tickets: - CVS-171910 --------- Co-authored-by: Vladislav Golubev <vladislav.golubev@intel.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description
This PR adds absent 3d FC related reorders for 4bit data types.
OpenVINO PR: openvinotoolkit/openvino#32450