Understanding block segmentation after recognition #1643
-
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 6 replies
-
|
Hi @ep0p 👋 It takes the sorted boxes from the estimated lines and tries to build a cluster depending on the distance to estimate which lines could build a block. For images like your provided one i suggest to disable the block resolving: |
Beta Was this translation helpful? Give feedback.
-
|
Yes, the issue is that i was kind of hoping to be able to split the page into paragraphs. The layout is pretty complex and i need to be able to properly segment it. Thank you for your answer. |
Beta Was this translation helpful? Give feedback.

Hi @ep0p 👋
It takes the sorted boxes from the estimated lines and tries to build a cluster depending on the distance to estimate which lines could build a block.
ref.: https://github.com/mindee/doctr/blob/3f8bd0104411c671d9c9fa09934c06f970c7cb67/doctr/models/builder.py#L156C25-L156C30
For images like your provided one i suggest to disable the block resolving:
resolve_blocks=Falsethen it will only sort the boxes and lines :)