Skip to content

[NPUW]Add MOE caching properties to NPU plugin#34648

Open
intelgaoxiong wants to merge 4 commits intoopenvinotoolkit:masterfrom
intelgaoxiong:xiong/fixed_missing_prop
Open

[NPUW]Add MOE caching properties to NPU plugin#34648
intelgaoxiong wants to merge 4 commits intoopenvinotoolkit:masterfrom
intelgaoxiong:xiong/fixed_missing_prop

Conversation

@intelgaoxiong
Copy link
Contributor

Details:

Add the following NPUW MOE-related properties to _cachingProperties:

  • NPUW_MOE_TOKEN_CHUNK_SIZE
  • NPUW_MOE_POOL_SIZE
  • NPUW_LLM_PREFILL_MOE_HINT
  • NPUW_LLM_GENERATE_MOE_HINT

These properties affect model compilation and need to be part of the cache key.

Tickets:

  • ticket-id

AI Assistance:

  • AI assistance used: no / yes
  • If yes, summarize how AI was used and what human validation was performed (build/tests/manual checks).

Add the following NPUW MOE-related properties to _cachingProperties:
- NPUW_MOE_TOKEN_CHUNK_SIZE
- NPUW_MOE_POOL_SIZE
- NPUW_LLM_PREFILL_MOE_HINT
- NPUW_LLM_GENERATE_MOE_HINT

These properties affect model compilation and need to be part of the cache key.
@github-actions github-actions bot added the category: NPU OpenVINO NPU plugin label Mar 12, 2026
@intelgaoxiong intelgaoxiong marked this pull request as ready for review March 12, 2026 04:52
@intelgaoxiong intelgaoxiong requested review from a team as code owners March 12, 2026 04:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: NPU OpenVINO NPU plugin

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant