Add dot product computation between two quantized vectors by vekterli · Pull Request #37250 · vespa-engine/vespa

vekterli · 2026-06-18T15:52:52Z

@havardpe please review

This is more efficient than explicitly dequantizing the vectors (and then running a float32 dot product) since it does not involve any inverse rotations. Vectors must have been quantized using the logically same quantizer and using InnerProduct mode for the output to make sense.

Same const-ness/thread-safety caveats apply as other quantizer functions due to the current need for scratch space for unpacking of bits.

This is more efficient than explicitly dequantizing the vectors (and then running a float32 dot product) since it does not involve any inverse rotations. Vectors must have been quantized using the _logically_ same quantizer, using `InnerProduct` mode. Same `const`-ness/thread-safety caveats apply as other quantizer functions due to the current need for scratch space for unpacking of bits.

vekterli requested a review from havardpe June 18, 2026 15:52

havardpe approved these changes Jun 19, 2026

View reviewed changes

vekterli merged commit 78cb90a into master Jun 19, 2026
3 checks passed

vekterli deleted the vekterli/quantized-lhs-rhs-dot-product branch June 19, 2026 09:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add dot product computation between two quantized vectors#37250

Add dot product computation between two quantized vectors#37250
vekterli merged 1 commit into
masterfrom
vekterli/quantized-lhs-rhs-dot-product

vekterli commented Jun 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vekterli commented Jun 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants