Skip to content

arcruz0/tesseractgt

Repository files navigation

tesseractgt: Generate Ground Truth Data for Tesseract

In an early stage (v0.0.4). Check out the blog post with a fully developed example and the package documentation.

GIF showing how the tesseractgt GUI works

Step-by-step summary

(Go to the blog post for more detailed explanations of each step.)

  1. Install the Tesseract engine, tesseract, and tesseractgt.

  2. Take screenshots to serve as fine-tuning images, and save them in a folder.

  3. Use tesseractgt::create_gt_txt() to create and pre-fill files with the text in the images.

  4. Call the GUI from "Addins > Correct ground truth files" in RStudio or tesseractgt::correct_gt_txt(). Use it to correct the text files created in step 2.

  5. Fine-tune using tesstrain.

Installation

install.packages("remotes") # if `remotes` is not installed
remotes::install_github("arcruz0/tesseractgt")

About

An R package to generate ground truth data for Tesseract

Resources

License

Unknown, MIT licenses found

Licenses found

Unknown
LICENSE
MIT
LICENSE.md

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages