OCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
Receipt OCR using CURL, JavaScript/Node.Js, Java, C# VB.NET, PHP, Python, etc
A flutter plugin that implements Google's standalone ML Kit
Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The official github repository for Leptonica is: danbloomberg/le…
Tesseract Open Source OCR Engine (main repository)
A python module that wraps the pdftoppm utility to convert PDF to PIL Image object
Fast integer versions of trained LSTM models
Best (most accurate) trained LSTM models.
Trained models with fast variant of the "best" LSTM models + legacy models
OCR/handwriting recognition libraries comparison
Tensorflow MNIST and preprocessing
The idea is mainly focused on extraction of textual data from any pdf or image. A separate neural network for handwritten data recognition is created which is trained on open source EMIST dataset. …