Skip to content

Enabling support of rerankers models 2B and 8B of qwen3vl#921

Open
quic-amitraj wants to merge 1 commit intoquic:mainfrom
quic-amitraj:qwen3_rankers
Open

Enabling support of rerankers models 2B and 8B of qwen3vl#921
quic-amitraj wants to merge 1 commit intoquic:mainfrom
quic-amitraj:qwen3_rankers

Conversation

@quic-amitraj
Copy link
Copy Markdown
Contributor

@quic-amitraj quic-amitraj commented Apr 18, 2026

This PR adds support for the following Qwen3-VL reranker models on AI100:

  • Qwen/Qwen3-VL-Reranker-2B
  • Qwen/Qwen3-VL-Reranker-8B

The support is implemented using the existing QEff image-text-to-text flow (dual QPC), with model parity validation focused on PyTorch(original) vs AI100.

Results

Model PyTorch score AI100 score MAD max Status
Qwen/Qwen3-VL-Reranker-2B 0.3213230073 0.3259495199 4.626513e-03 Pass
Qwen/Qwen3-VL-Reranker-8B 0.6058825254 0.6043989062 1.483619e-03 Pass

@quic-amitraj quic-amitraj changed the title Enabling support of rerankers models 2B and 8B of qwen3vl bucket Enabling support of rerankers models 2B and 8B of qwen3vl Apr 18, 2026
Signed-off-by: Amit Raj <amitraj@qti.qualcomm.com>
@quic-amitraj quic-amitraj marked this pull request as ready for review April 19, 2026 10:45
@quic-amitraj quic-amitraj self-assigned this Apr 20, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant