This is an n8n community node. It lets you use crawl and scrape websites in your n8n workflows.
I used crawlee for this project. Crawlee is a powerful web scraping and crawling library that helps you extract data from websites efficiently and reliably.
n8n is a fair-code licensed workflow automation platform.
Installation
Operations
Credentials
Compatibility
Resources
Version history
Follow the installation guide in the n8n community nodes documentation.
The node supports the following operations:
- Extract Links: Crawls a website and extracts all links found on the page
- Extract Text: Extracts all text content from a webpage
- Extract HTML: Retrieves the raw HTML content of a webpage
No credentials are required to use this node.
This node is compatible with n8n version 1.0.0 and above.
- Initial release
- Added support for basic web scraping operations:
- Extract Links
- Extract Text
- Extract HTML