In what world do AI scrapers actually pay for data? #24
quat1024
started this conversation in
Compliance by AI developers
Replies: 2 comments 2 replies
-
|
Basically this is an impossible problem to solve and no amount of floundering with signals will make them care at all |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
Links related to preventing AI crawling:
|
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
AI scrapers are known to:
Quoth A season on Iocaine.
This is the ecosystem in which projects like
anubisandgo-awayexist. Users of these tools are not necessarily against AI training using content on their website. But in the current ecosystem it is not practical to allow AI scrapers anywhere within ten feet of their website because this is how they behave.They behave like this because it is cheap. Violating robots.txt is cheap, violating unwritten rules about user-agents is cheap, renting residential IPs is an acceptable cost of doing business. Any and all social barriers to delicious content are ignored; they only don't bother trying to pass
anubischallenges because it is economically expensive to do so.Given this is the ecosystem we are in, what is the economic reason for an AI scraping outfit to give a single shit about:
Quoth the report:
There is precedent of the opposite; AI scrapers deciding
robots.txtdoesn't apply to them. Compare WIRED, 2024:They do not care. None of this will make them care.
Beta Was this translation helpful? Give feedback.
All reactions