Skip to content

"stage output" -> mixed: picture zones layer #191

@Golddouble

Description

@Golddouble

This is a question to the function "stage output" -> mixed

I do not understand, what the advantage (or sense) of the function "mixed" has compared with "Colour / Gray scale" mode?

Question 1:
Is the sense, to save place? (I mean to make the tiff-file smaller compared with "Colour / Gray scale" mode)?

When I choose mixed-mode, then I can use the "picture zones to automatically detect pictures and separate them from text.

One problem of tesseract based OCR programmes is, that they can not proper separate text from picture. It looks like ScanTailor can this better. And in the tesseract based OCR programmes we have not the possibility to manually mark/select "text areas" to help tesseract only to apply OCR on areas that are really text.

So I ask me, if I can in any way use the mixed -> picture/text zones detected through ScanTailor in my OCR programme.

Question 2:
You speak about "auto layers" that can be seen in the tab "picture zones". Are this zones somehow saved in the resulting tiff?
And if yes, can my OCR programme this zones use, to decide, if it should apply OCR to find text in a certain zone or not.

Would appreciate some answer.
Thank you.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions