Skip to content

Different OCR results for images #416

Open
@vivadavid

Description

Describe the bug
Using the same images, I get different OCR results depending on whether I use the Extract Text from Images in Folder tool or I simply drag and drop the images on the Edit Text Window. This only happens when using Spanish Tesseract, whereas if I use the Microsoft OCR engine for Spanish, I get the same recognized text no matter the approach.

These are my results (see attached ZIP file):

  1. On image1.jpg, no text is recognized (but it's recognized when I drag and drop the image).
  2. On image2.jpg, the text is recognized (but it's not exactly the same text as the text recognized when I drag and drop the image).

Where is the bug

  • OCR Output.

Where did you get Text Grab?
- Exe

Desktop (please complete the following information):

  • OS: Windows 11.
  • Version 23H2.
  • Text Grab 4.3.1.

files.zip

Metadata

Assignees

No one assigned

    Labels

    Edit Text WindowAnything to do with the Edit Text Window or functions within itGeneral ProcessingRelating to the processing of images to some type of text outputbugSomething isn't working

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions