Relax AI scrape policy#510
Conversation
Switch to the permissive scrape policy to allow more well behaved AI crawlers. Signed-off-by: SuperQ <superq@gmail.com>
|
This should implement similar to #451. |
|
Do we need to make some updates to robots.txt with this? I'm admittedly getting a little head-swirly with the number of layers of bot detection here. right now robots.txt only blocks ClaudeBot and Amazonbot, but not GPTBot, Google-Extended, CCBot... etc |
|
Currently I only get this: |
oh i see -- there's an extant playbook for robots_txt that hasn't been run, maybe? from April 2024 (Sorry, still finding a lot of this confusing and didn't actually look at live robots.txt first) |
|
Yea, probably just never deployed. |
nthmost
left a comment
There was a problem hiding this comment.
should we clean up the robots_txt tho? (separately)
|
Yes, we should probably do a round of cleanup of the robots.txt and actually deploy it this time. :) |
Switch to the permissive scrape policy to allow more well behaved AI crawlers.