Open
Description
I am using tesseract to extract text from an image. Preserving the structure of the document is very important to me. Currently tesseract does not preserve the structure, infact it changes the order of text. My input is the image below.
After processing i've got a simple text without any additional tabs and spaces:
Someto the left
Someto the left
Some in the middle
Some in the middle
Some with some tab
Some with some tab
Some with some space between them
Some with some space between them
Sometext here
Sometext here
this much
this much
Please, help me, how do I get the desired output as of the same structure in image?
Activity