First off, thanks for all the work you've put into this tool!
May I humbly suggest, though, that to help get the binary tool into a production-ready state as quickly as possible that you replace your RTen ONNX runtime backend for a more production-ready alternative that's already being used widely in production scenarios in industry.
What I would recommend is checking out ORT ('ONNX RunTime').
See: https://github.com/pykeio/ort
Just to give you a quick sense of what I mean by 'ORT is production-ready', it is being used at:
- Twitter / X
- in Google's Magika library, which is used in Gmail
- ORT is used in the Wasmtime WASI runtime, which powers their WASI-NN API (which is used by multiple companies, such as Fermyon's 'Spin' Wasi FAAS platform, the CNCF's WasmCloud project, etc)
- as part of SurrealDB
...
While creating an ONNX runtime is a fantastic learning experience, for right now, OCRS might benefit from the production-ready ONNX RT support that you'd get from a solution like ort
Just a suggestion...