You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
||**`AhoCorasickBufferedProcessorNormalized`**<br>✓ Whitespace-invariant detection<br>✓ Flexible text matching<br>✓ Preserves original text positions |
48
16
49
17
### Utility Functions
50
18
@@ -58,14 +26,14 @@ The **Pattern Detection Module** provides utilities for detecting predefined pat
58
26
The module processes text in two primary stages:
59
27
60
28
1.**Pattern Preprocessing**
61
-
- Patterns are loaded from YAML configuration
62
-
- The Aho-Corasick automaton is constructed from patterns
63
-
- Failure links connect states for efficient pattern transitions
29
+
- Patterns are loaded from YAML configuration
30
+
- The Aho-Corasick automaton is constructed from patterns
31
+
- Failure links connect states for efficient pattern transitions
64
32
65
33
2.**Buffered Text Processing**
66
-
- Text is processed in manageable chunks
67
-
- Partial matches at chunk boundaries are preserved
68
-
- Match information includes pattern name and position
34
+
- Text is processed in manageable chunks
35
+
- Partial matches at chunk boundaries are preserved
36
+
- Match information includes pattern name and position
69
37
70
38
### Text Normalization Pipeline
71
39
@@ -81,16 +49,19 @@ The module processes text in two primary stages:
81
49
82
50
## Performance Considerations
83
51
84
-
***Time Complexity**: O(n + m + k) where:
52
+
**Time Complexity** O(n + m + k) where:
53
+
85
54
* n = length of input text
86
55
* m = total length of all patterns
87
56
* k = number of pattern occurrences
88
57
89
-
***Space Efficiency**:
58
+
**Space Efficiency**:
59
+
90
60
* Buffered processing minimizes memory usage
91
61
* Suitable for streaming applications with unbounded input
92
62
93
-
***Flexibility vs. Performance**:
63
+
**Flexibility vs. Performance**:
64
+
94
65
* Standard processors offer exact matching with minimal overhead
95
66
* Normalized processors provide flexibility with slight computational cost
0 commit comments