In an early stage (v0.0.4). Check out the blog post with a fully developed example and the package documentation.
(Go to the blog post for more detailed explanations of each step.)
-
Install the Tesseract engine,
tesseract, andtesseractgt. -
Take screenshots to serve as fine-tuning images, and save them in a folder.
-
Use
tesseractgt::create_gt_txt()to create and pre-fill files with the text in the images. -
Call the GUI from "Addins > Correct ground truth files" in RStudio or
tesseractgt::correct_gt_txt(). Use it to correct the text files created in step 2. -
Fine-tune using
tesstrain.
install.packages("remotes") # if `remotes` is not installed
remotes::install_github("arcruz0/tesseractgt")