-
-
Notifications
You must be signed in to change notification settings - Fork 116
Open
Description
Hi all. Love the project!
In my use case, I'm trying to get coordinates for all 'Pictures' in a given PDF. I have found that when merge_colliding_predictions is run from get_pdf_segments_for_page in src/vgt/get_most_probable_pdf_segments.py, many images are merge with 'Text' or 'Caption' elements. In my case, I'd prefer to have all the detected elements present, then sort them out / detect collisions on my own.
My quick and dirty hack is simply to comment out line 96 of the file above. Ideally, we'd define this by using a parameter handled by app.py. Is this something other people care about (i.e., is worth my submitting a PR), or is this just me?
Metadata
Metadata
Assignees
Labels
No labels