Skip to content

Getting pre-merged predictions #113

@hutchhicken

Description

@hutchhicken

Hi all. Love the project!

In my use case, I'm trying to get coordinates for all 'Pictures' in a given PDF. I have found that when merge_colliding_predictions is run from get_pdf_segments_for_page in src/vgt/get_most_probable_pdf_segments.py, many images are merge with 'Text' or 'Caption' elements. In my case, I'd prefer to have all the detected elements present, then sort them out / detect collisions on my own.

My quick and dirty hack is simply to comment out line 96 of the file above. Ideally, we'd define this by using a parameter handled by app.py. Is this something other people care about (i.e., is worth my submitting a PR), or is this just me?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions