Skip to content
This repository was archived by the owner on Apr 23, 2018. It is now read-only.

JHOCR Usage

4F2E4A2E edited this page Jun 4, 2015 · 2 revisions

How to use jhocr

Create your searchable pdf from an image using the tesseract java api tess4j or the tesseract binaries installed on your system.

How it works

https://jhocr.googlecode.com/svn/wiki/resources/documentation/searchable-pdf-explained_01.png

txt to be added soon ...

Embededd

Create your searchable pdf from an image using the tesseract java api tess4j.

tbd ...


On MacOSX

MacOSX Part 1

Get home brew installed.

  1. check the home brew page for newer howtos
  2. otherwise run on your terminal (don't run it as sudo):
ruby -e "$(curl -fsSL https://raw.github.com/mxcl/homebrew/go/install)"
  1. accept xcode install
  2. run on your terminal:
brew doctor
  1. follow the steps an than run:
export PATH='/usr/local/bin:$PATH' >> ~/.bash_profile

MacOSX Part 2

  1. run on your terminal:
brew install tesseract
  1. grab a bier
  2. wait untill finished and than test your installation with:
$ tesseract -v
tesseract 3.02.02
 leptonica-1.69
  libjpeg 8d : libpng 1.5.17 : libtiff 4.0.3 : zlib 1.2.5

Sources used for this article:

On Windows

  1. Next, Next, Next and Finish :)

tbd ...

On Linux

  1. sudo apt-get install tesseract-ocr

tbd ...

Clone this wiki locally