Skip to content

some questions about horizontal scaling and resources #40881

Discussion options

You must be logged in to vote

Thank you, that has clarified some things for me.

I have one more question: could you provide any advice on increasing maximum search throughput? Currently, increasing QPS results in higher latency, but throughput remains +- the same. My setup is:

  • Milvus 2.5.6
  • 3 query nodes (8 CPU, 56 GiB RAM each)
  • 3 collections, each with 12 million vectors (vector size = 1024), indexed using IVF_FLAT with nlist = 3162
  • Each collection has 1 partition, about 40 segments (1 GB), 1 replica, and Bounded consistency.

I have already found in discussions that one way to increase search performance is to set replicas = 2 and deploy x2 query nodes. However, this seems quite expensive, as each additional 80-100…

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@xiaofan-luan
Comment options

Answer selected by artesby
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants