You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Previously, OCR was used for images in PDF documents when those were parsed by Apache Tika. (Tika uses Apache Tesseract for OCR.)
Our custom PDF document parser based on Apache uses Apache PDFBox and currently does not use OCR.
Previously, OCR was used for images in PDF documents when those were parsed by Apache Tika. (Tika uses Apache Tesseract for OCR.)
Our custom PDF document parser based on Apache uses Apache PDFBox and currently does not use OCR.