Skip to content

Commit c857eba

Browse files
authored
Update how-to-debug-slow-requests-in-milvus.md
1 parent 2e9c180 commit c857eba

File tree

1 file changed

+9
-9
lines changed

1 file changed

+9
-9
lines changed

blog/en/how-to-debug-slow-requests-in-milvus.md

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ Key panels include:
4040

4141
* **Service Quality → Search Latency**: Shows overall latency distribution. If this looks normal, but end users still see delays, the problem is likely outside Milvus—in the network or application layer.
4242

43-
* **Query Node → Search Latency by Phase**: Breaks latency into queue, query, and reduce stages. For deeper attribution, panels such as _Scalar_ _Filter Latency_, _Vector Search Latency_, and _Wait Safe Latency_ reveal which stage dominates.
43+
* **Query Node → Search Latency by Phase**: Breaks latency into queue, query, and reduce stages. For deeper attribution, panels such as _Scalar_ _Filter Latency_, _Vector Search Latency_, and _Wait tSafe Latency_ reveal which stage dominates.
4444

4545

4646
### Milvus Logs
@@ -74,7 +74,7 @@ A common cause of slow requests is an excessive workload. When a request has a v
7474

7575
* All queries show unexpectedly high latency.
7676

77-
* Query Node metrics report rising **in-queue latency**.
77+
* Query Node metrics report high **in-queue latency**.
7878

7979
* Logs show a request with a large NQ and a long total duration, but a relatively small durationPerNQ—indicating that one oversized request is dominating resources.
8080

@@ -88,7 +88,7 @@ A common cause of slow requests is an excessive workload. When a request has a v
8888

8989
### Inefficient Filtering
9090

91-
Another common bottleneck comes from inefficient filters. If filter expressions are poorly structured or fields lack scalar indexes, Milvus may fall back to a **full scan** instead of scanning a small, targeted subset. JSON filters and strict consistency settings can further increase overhead.
91+
Another common bottleneck comes from inefficient filters. If filter expressions are poorly conducted or fields lack scalar indexes, Milvus may fall back to a **full scan** instead of scanning a small, targeted subset. JSON filters and strict consistency settings can further increase overhead.
9292

9393
![ ](https://assets.zilliz.com/inefficient_filtering_e524615d63.png)
9494

@@ -112,13 +112,13 @@ tag = {"tag": ["A", "B", "C", "D"]}
112112
filter_expr = "tag IN {tag}"
113113
```
114114

115-
Milvus also introduces a filter expression templating mechanism designed to improve efficiency by reducing the time spent parsing complex expressions. See [this document](https://milvus.io/docs/filtering-templating.md) for more details.
115+
* Milvus also introduces a filter expression templating mechanism designed to improve efficiency by reducing the time spent parsing complex expressions. See [this document](https://milvus.io/docs/filtering-templating.md) for more details.
116116

117-
* **Add proper indexes**: Avoid full scans by creating scalar indexes on fields used in filters.
118-
119-
* **Handle JSON efficiently**: Milvus 2.6 introduced path and flat indexes for JSON fields, enabling efficient handling of JSON data. JSON shredding is also on [the roadmap](https://milvus.io/docs/roadmap.md) to further improve performance. Refer to [the JSON field document](https://milvus.io/docs/use-json-fields.md#JSON-Field) for additional information.  
120-
121-
* **Tune consistency level**: Use `_Bounded`_ or `_Eventually`_ consistent reads when strict guarantees are not required, reducing `tSafe` wait time.
117+
* **Add proper indexes**: Avoid full scans by creating scalar indexes on fields used in filters.
118+
119+
* **Handle JSON efficiently**: Milvus 2.6 introduced path and flat indexes for JSON fields, enabling efficient handling of JSON data. JSON shredding is also on [the roadmap](https://milvus.io/docs/roadmap.md) to further improve performance. Refer to [the JSON field document](https://milvus.io/docs/use-json-fields.md#JSON-Field) for additional information.  
120+
121+
* **Tune consistency level**: Use *Bounded* or *Eventually* consistent reads when strict guarantees are not required, reducing *tSafe* wait time.
122122

123123

124124
### Improper Choice of Vector Index

0 commit comments

Comments
 (0)