Skip to content

Commit c8c4f17

Browse files
authored
Strip URLs from sitemap.
Fixes #30829
1 parent 2282762 commit c8c4f17

File tree

1 file changed

+1
-1
lines changed
  • libs/community/langchain_community/document_loaders

1 file changed

+1
-1
lines changed

libs/community/langchain_community/document_loaders/sitemap.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -185,7 +185,7 @@ def parse_sitemap(self, soup: Any, *, depth: int = 0) -> List[dict]:
185185

186186
els.append(
187187
{
188-
tag: prop.text
188+
tag: prop.text.strip()
189189
for tag in ["loc", "lastmod", "changefreq", "priority"]
190190
if (prop := url.find(tag))
191191
}

0 commit comments

Comments
 (0)