Suggestion: Document comparison to Tika #212
nickchomey
started this conversation in
Ideas
Replies: 1 comment
-
|
Ok, I'll adopt your proposal and replace extraction with apache Tika |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Kreuzberg seems extremely impressive. Thanks for sharing your hard work!
However, Tika is the gold standard in this area and vastly more mature. So, it seems to me that it would be helpful to compare Kreuzberg to Tika in the readme and/or docs. Which file types they work with, potential concerns with edge cases, performance, features etc...
Likewise, Extractous embeds/wraps Tika in rust to make it more accessible, but it is seemingly abandonned (and has much less functionality. Despite that, a mention of how Kreuzberg differs would be great.
(I expect Kreuzberg to compare very favourably).
Beta Was this translation helpful? Give feedback.
All reactions