You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: blog/en/how-to-debug-slow-requests-in-milvus.md
+9-9Lines changed: 9 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -40,7 +40,7 @@ Key panels include:
40
40
41
41
***Service Quality → Search Latency**: Shows overall latency distribution. If this looks normal, but end users still see delays, the problem is likely outside Milvus—in the network or application layer.
42
42
43
-
***Query Node → Search Latency by Phase**: Breaks latency into queue, query, and reduce stages. For deeper attribution, panels such as _Scalar__Filter Latency_, _Vector Search Latency_, and _Wait Safe Latency_ reveal which stage dominates.
43
+
***Query Node → Search Latency by Phase**: Breaks latency into queue, query, and reduce stages. For deeper attribution, panels such as _Scalar__Filter Latency_, _Vector Search Latency_, and _Wait tSafe Latency_ reveal which stage dominates.
44
44
45
45
46
46
### Milvus Logs
@@ -74,7 +74,7 @@ A common cause of slow requests is an excessive workload. When a request has a v
* Logs show a request with a large NQ and a long total duration, but a relatively small durationPerNQ—indicating that one oversized request is dominating resources.
80
80
@@ -88,7 +88,7 @@ A common cause of slow requests is an excessive workload. When a request has a v
88
88
89
89
### Inefficient Filtering
90
90
91
-
Another common bottleneck comes from inefficient filters. If filter expressions are poorly structured or fields lack scalar indexes, Milvus may fall back to a **full scan** instead of scanning a small, targeted subset. JSON filters and strict consistency settings can further increase overhead.
91
+
Another common bottleneck comes from inefficient filters. If filter expressions are poorly conducted or fields lack scalar indexes, Milvus may fall back to a **full scan** instead of scanning a small, targeted subset. JSON filters and strict consistency settings can further increase overhead.
Milvus also introduces a filter expression templating mechanism designed to improve efficiency by reducing the time spent parsing complex expressions. See [this document](https://milvus.io/docs/filtering-templating.md) for more details.
115
+
*Milvus also introduces a filter expression templating mechanism designed to improve efficiency by reducing the time spent parsing complex expressions. See [this document](https://milvus.io/docs/filtering-templating.md) for more details.
116
116
117
-
***Add proper indexes**: Avoid full scans by creating scalar indexes on fields used in filters.
118
-
119
-
***Handle JSON efficiently**: Milvus 2.6 introduced path and flat indexes for JSON fields, enabling efficient handling of JSON data. JSON shredding is also on [the roadmap](https://milvus.io/docs/roadmap.md) to further improve performance. Refer to [the JSON field document](https://milvus.io/docs/use-json-fields.md#JSON-Field) for additional information.
120
-
121
-
***Tune consistency level**: Use `_Bounded`_ or `_Eventually`_ consistent reads when strict guarantees are not required, reducing `tSafe` wait time.
117
+
***Add proper indexes**: Avoid full scans by creating scalar indexes on fields used in filters.
118
+
119
+
***Handle JSON efficiently**: Milvus 2.6 introduced path and flat indexes for JSON fields, enabling efficient handling of JSON data. JSON shredding is also on [the roadmap](https://milvus.io/docs/roadmap.md) to further improve performance. Refer to [the JSON field document](https://milvus.io/docs/use-json-fields.md#JSON-Field) for additional information.
120
+
121
+
***Tune consistency level**: Use *Bounded* or *Eventually* consistent reads when strict guarantees are not required, reducing *tSafe* wait time.
0 commit comments