Skip to content

[models] Handle watermarks #292

@charlesmindee

Description

@charlesmindee

Handle watemarks: we don't want the main information of the document to be parasitized by watermarks, so we need an OCR able to distinguish and filter watermarks.

image (5)

We could work on word occurrences and find areas covered by highly-repeated words, we could also work on relative contrasts.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions