How to preserve document structure in tesseract?

I am using tesseract to extract text from an image. Preserving the structure of the document is very important to me. Currently tesseract does not preserve the structure, infact it changes the order of text. My input is the image below.
![doc_structur](https://cloud.githubusercontent.com/assets/14837670/10939324/7c8c9a16-8309-11e5-9151-be80d6668069.png)

After processing i've got a simple text without any additional tabs and spaces:
Someto the left
Someto the left

Some in the middle
Some in the middle

Some with some tab
Some with some tab

Some with some space between them
Some with some space between them

Sometext here
Sometext here

this much
this much

Please, help me, how do I get the desired output as of the same structure in image?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to preserve document structure in tesseract? #221

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

How to preserve document structure in tesseract? #221

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions