[Feature Request]: Remove GDPR Banners #1005
nikolakretschmer
started this conversation in
Feature requests
Replies: 1 comment
-
Yes this please... those damn cookie banners are really preventing 99% of websites to be crawled and it is really unusable like this. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
What needs to be done?
Create a CrawlerRunConfig option/feature to reliably remove GDPR / Cookie-Consent popups and banners from crawled sites.
What problem does this solve?
With these banners/popups present the resulting markdown will often contain content of the cookie popup and not the actual site content.
Target users/beneficiaries
EU users
Current alternatives/workarounds
I've tried every trick under the sun to remove these obnoxious popups. Injecting JS-Code, loading a chrome/firefox extension, remove_overlay_elements ... nothing has worked so far. Until this fix is implemented, I might have to switch to playwright and use some other parser for the markdown.
Proposed approach
I don't know. You guys are the wizards.
Beta Was this translation helpful? Give feedback.
All reactions