-
|
According to the following documentation: https://qdrant.tech/documentation/guides/quantization/#binary-quantization It is mentioned that “Binary quantization can achieve up to a 40x speedup compared to the original vectors.” I want to confirm whether this 40x speedup refers to an improvement in RPS (requests per second) when using binary quantization, compared to using the original vectors. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
|
Yes, that translates to 40x the RPS. It is of course dependent on your configuration and accuracy needs. Some more details here: https://qdrant.tech/articles/binary-quantization/ |
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.

#7922 (comment)