Skip to content

Npu docs#3962

Open
rasapala wants to merge 10 commits intomainfrom
npu_docs
Open

Npu docs#3962
rasapala wants to merge 10 commits intomainfrom
npu_docs

Conversation

@rasapala
Copy link
Collaborator

@rasapala rasapala commented Feb 9, 2026

🛠 Summary

Updating export and NPU usage.

🧪 Checklist

  • Unit tests added.
  • The documentation updated.
  • Change follows security best practices.
    ``

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is inconsistent with the command used later. should be /dev/accel

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Only NPU tests are limited to qwen3-embeddings

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes and it is clearly stated in documentation. Scroll up to see full list of validated models on CPU/GPU. The list you comment clearly specifies models validated on NPU, which is correct

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

client code should be the same for all target devices

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why do we have a separate table for models tested and tested on npu? It would be clearer to add a column with a checkbox for npu enabled models.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be in a the section related to model export above. No need to make it a separate chapter.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@rasapala @michalkulakowski do we have all tests completed to confirm int4 gives good results?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants

Comments