Skip to content

Improve Setup and Docs #21

@patchmeifyoucan

Description

@patchmeifyoucan

Hello,

I am developing a similar application to yours and would like to include yours in a publication benchmark.
However, I was not able to get the repository running. Errors range from issues with OCR to segmentation
faults, which is pretty bad. The barrier to "just run" the benchmark on a few thousand images is quite high.

Would it be possible to streamline the whole process? Tools like DECIMER
provide a pip package which an easy-to-use Python interface (which works on CPU only on my machines, which
I find ok since it is quite an effort to have broad CUDA support as repos get older).
A minimalistic version would include one import, either the model itself of an inference function. Additionally,
there should be a simpler way to configure the model other than commenting code.
So far, there is no documentation about how the system works, what it does in terms of its code and flags are hidden
in scripts and issues. Behavior like this should ideally also be
documented such that someone looking at the repo the first time gets a sense of what different modes the
model supports.

I think something along these lines would help users of the model or those wanting to benchmark against it
since it is among the best models currently and should be used and included in OCSR benchmarks.

Thanks a lot.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions