Skip to content

A test page designed to validate web crawlers' compliance with restrictions and anti-scraping measures. Features rate limiting, access controls, and crawling deterrents.

License

Notifications You must be signed in to change notification settings

CameronWhiteside/crawl-stop

Repository files navigation

CrawlStop.com

A test page designed to validate web crawlers' ability to respect crawling controls and anti-scraping measures. Built with modern web standards including semantic HTML5, JSON-LD structured data, and various crawling deterrents.

Features

  • Semantic HTML5 markup
  • JSON-LD structured data
  • Cloudflare managed robots.txt with restrictive directives
  • sitemap.xml
  • Anti-crawling controls and deterrents
  • Rate limiting and access controls

Purpose

Visit crawlstop.com to test your web crawler's compliance with crawling restrictions. The page is designed to discourage and block automated access, providing a reliable baseline for testing anti-scraping capabilities and crawler behavior when encountering access controls.

Development

This is a site built with React and Cloudflare Workers. To preview locally:

npm install
npm run build
npm run preview

License

MIT License - see LICENSE file for details.

Contributing

Feel free to submit issues, feature requests, or pull requests. This project is open source and welcomes contributions from the community.

About

A test page designed to validate web crawlers' compliance with restrictions and anti-scraping measures. Features rate limiting, access controls, and crawling deterrents.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published