File tree Expand file tree Collapse file tree
docs/source/developer_documentation Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -55,6 +55,13 @@ Two config fields tune this:
5555- ` jobs_per_gpu ` (default ` 1 ` ): jobs processed per GPU at once. Total workers = GPUs * ` jobs_per_gpu ` .
5656- ` max_queue_size ` (default ` null ` = ` num_gpu ` * ` jobs_per_gpu ` * 10): pending-job cap, uploads beyond it get ` 503 ` http error.
5757
58+ GPUs are auto-detected, but you can restrict which (and how many) mmore uses with the ` CUDA_VISIBLE_DEVICES ` environment variable:
59+
60+ ``` bash
61+ # If you want to use only GPUs 0 and 2
62+ CUDA_VISIBLE_DEVICES=0,2 python3 -m mmore index-api --config-file /path/to/config.yaml --host the_host --port the_port
63+ ```
64+
5865``` {note}
5966For `jobs_per_gpu > 1`, prefer a **Milvus Standalone** server
6067(`db.uri: http://localhost:19530`) over Milvus Lite. Milvus Standalone is better
You can’t perform that action at this time.
0 commit comments