Npu docs by rasapala · Pull Request #3962 · openvinotoolkit/model_server

rasapala · 2026-02-09T16:48:53Z

🛠 Summary

Updating export and NPU usage.

🧪 Checklist

Unit tests added.
The documentation updated.
Change follows security best practices.
``

…int4

dtrawins · 2026-02-10T10:07:55Z

demos/embeddings/README.md

this is inconsistent with the command used later. should be /dev/accel

dtrawins · 2026-02-10T10:09:14Z

demos/embeddings/README.md

Only NPU tests are limited to qwen3-embeddings

Yes and it is clearly stated in documentation. Scroll up to see full list of validated models on CPU/GPU. The list you comment clearly specifies models validated on NPU, which is correct

dtrawins · 2026-02-10T10:10:07Z

demos/embeddings/README.md

client code should be the same for all target devices

demos/embeddings/README.md

Co-authored-by: Trawinski, Dariusz <dariusz.trawinski@intel.com>

dtrawins · 2026-02-18T12:51:45Z

demos/embeddings/README.md

why do we have a separate table for models tested and tested on npu? It would be clearer to add a column with a checkbox for npu enabled models.

dtrawins · 2026-02-18T12:52:39Z

demos/embeddings/README.md

This should be in a the section related to model export above. No need to make it a separate chapter.

dtrawins · 2026-02-18T12:54:02Z

demos/embeddings/README.md

@rasapala @michalkulakowski do we have all tests completed to confirm int4 gives good results?

Npu docs

f90fd93

rasapala requested review from dtrawins and michalkulakowski February 9, 2026 16:48

rasapala and others added 2 commits February 9, 2026 17:55

Spell

6fde330

Remove reference that all models should work, they dont

eebc30f

dkalinowski approved these changes Feb 10, 2026

View reviewed changes

Remove default quantization params since those are only required for …

82faddc

…int4

dkalinowski force-pushed the npu_docs branch from b242b08 to 82faddc Compare February 10, 2026 10:14

dtrawins reviewed Feb 10, 2026

View reviewed changes

rasapala and others added 5 commits February 16, 2026 13:18

Apply suggestions from code review

4bc4006

Co-authored-by: Trawinski, Dariusz <dariusz.trawinski@intel.com>

Merge branch 'main' into npu_docs

73637dd

Code review

aabf976

Fix

db1d4ee

Fix model

f63121c

dtrawins reviewed Feb 18, 2026

View reviewed changes

Code review

a2899c3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Npu docs#3962

Npu docs#3962
rasapala wants to merge 10 commits intomainfrom
npu_docs

rasapala commented Feb 9, 2026

Uh oh!

dtrawins Feb 10, 2026

Uh oh!

dtrawins Feb 10, 2026

Uh oh!

dkalinowski Feb 10, 2026

Uh oh!

dtrawins Feb 10, 2026

Uh oh!

Uh oh!

Uh oh!

dtrawins Feb 18, 2026

Uh oh!

dtrawins Feb 18, 2026

Uh oh!

dtrawins Feb 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

rasapala commented Feb 9, 2026

🛠 Summary

🧪 Checklist

Uh oh!

dtrawins Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

dtrawins Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

dkalinowski Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

dtrawins Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dtrawins Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

dtrawins Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

dtrawins Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments