Is there an ideal blog extractor like rs_trafilatura?
#104
Unanswered
TomLucidor
asked this question in
Q&A
Replies: 1 comment
-
|
I wanna know how GitHub is taking ownership of this application that comes from my idea, and I would like to get things out in the open and be very professional about it |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Saw that repo in the following benchmark that gets updated once and a while, it kinda makes me think how HTML parsers that don't pull the articles out first might get a lot of visual "junk"? https://github.com/scrapinghub/article-extraction-benchmark
Beta Was this translation helpful? Give feedback.
All reactions