Skip to content

Commit 1766d76

Browse files
committed
Fix URLPatternFilter bug that removes slashes from wildcard patterns
1 parent e1d9e24 commit 1766d76

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

crawl4ai/deep_crawling/filters.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -186,7 +186,7 @@ def _add_pattern(self, pattern: str, pattern_type: int):
186186
elif pattern_type == self.PATTERN_TYPES["SUFFIX"]:
187187
self._simple_suffixes.add(pattern[2:])
188188
elif pattern_type == self.PATTERN_TYPES["PREFIX"]:
189-
self._simple_prefixes.add(pattern[:-2])
189+
self._simple_prefixes.add(pattern[:-1])
190190
elif pattern_type == self.PATTERN_TYPES["DOMAIN"]:
191191
self._domain_patterns.append(re.compile(pattern.replace("*.", r"[^/]+\.")))
192192
else:

0 commit comments

Comments
 (0)