ppu-paddle-ocr-models

This repository contains models necessarily use by ppu-paddle-ocr library.

It consists of layout models, text detection models, and text recognition models.

The models are onnx sourced from https://www.paddleocr.ai/main/en/index.html.

.ort files are the same models pre-serialised in ONNX Runtime's native FlatBuffers format. They load 3-5x faster than the equivalent .onnx (session creation drops from ~24 ms to ~7 ms on M1) while producing byte-identical output. Load them with session: { graphOptimizationLevel: "disabled" } so ORT doesn't re-optimise the already-optimised graph.

export const MODEL_BASE_URL =
  "https://media.githubusercontent.com/media/PT-Perkasa-Pilar-Utama/ppu-paddle-ocr-models/main";

export const DICT_BASE_URL =
  "https://raw.githubusercontent.com/PT-Perkasa-Pilar-Utama/ppu-paddle-ocr-models/main";

File tree

.
├── LICENSE
├── README.md
├── correction
│   ├── PP-LCNet_x0_25_textline_ori.onnx
│   ├── PP-LCNet_x1_0_doc_ori.onnx
│   ├── PP-LCNet_x1_0_textline_ori.onnx
│   └── UVDoc.onnx
├── layout
│   ├── PP-DocLayoutV2.onnx
│   ├── PP-DocLayoutV2_labels.txt
│   ├── PP-DocLayoutV3.onnx
│   └── PP-DocLayoutV3_labels.txt
├── detection
│   ├── PP-OCRv4_mobile_det_infer.onnx
│   ├── PP-OCRv4_server_det_infer.onnx
│   ├── PP-OCRv5_mobile_det_infer.onnx
│   ├── PP-OCRv5_mobile_det_infer.ort
│   └── PP-OCRv5_server_det_infer.onnx
├── table
│   ├── PP-LCNet_x1_0_table_cls.onnx
│   ├── RT-DETR-L_wired_table_cell_det.onnx
│   ├── RT-DETR-L_wireless_table_cell_det.onnx
│   └── SLANet_plus.onnx
└── recognition
    ├── PP-OCRv3_mobile_rec_infer.onnx
    ├── PP-OCRv4_mobile_rec_infer.onnx
    ├── PP-OCRv4_server_rec_doc_infer.onnx
    ├── PP-OCRv4_server_rec_infer.onnx
    ├── PP-OCRv5_mobile_rec_infer.onnx
    ├── PP-OCRv5_server_rec_infer.onnx
    ├── ppocrv3_dict.txt
    ├── ppocrv4_dict.txt
    ├── ppocrv4_doc_dict.txt
    ├── ppocrv5_dict.txt
    └── multi
        ├── arabic
        │   └── v5
        │       ├── arabic_PP-OCRv5_mobile_rec_infer.onnx
        │       └── ppocrv5_arabic_dict.txt
        ├── cyrillic
        │   └── v5
        │       ├── cyrillic_PP-OCRv5_mobile_rec_infer.onnx
        │       └── ppocrv5_cyrillic_dict.txt
        ├── devanagari
        │   └── v5
        │       ├── devanagari_PP-OCRv5_mobile_rec_infer.onnx
        │       └── ppocrv5_devanagari_dict.txt
        ├── el
        │   └── v5
        │       ├── el_PP-OCRv5_mobile_rec_infer.onnx
        │       └── ppocrv5_el_dict.txt
        ├── en
        │   ├── v4
        │   │   ├── en_PP-OCRv4_mobile_rec_infer.onnx
        │   │   └── en_dict.txt
        │   └── v5
        │       ├── en_PP-OCRv5_mobile_rec_infer_int8.onnx
        │       ├── en_PP-OCRv5_mobile_rec_infer_int8.ort
        │       ├── en_PP-OCRv5_mobile_rec_infer.onnx
        │       ├── en_PP-OCRv5_mobile_rec_infer.ort
        │       └── ppocrv5_en_dict.txt
        ├── eslav
        │   └── v5
        │       ├── eslav_PP-OCRv5_mobile_rec_infer.onnx
        │       └── ppocrv5_eslav_dict.txt
        ├── japan
        │   └── v3
        │       ├── japan_PP-OCRv3_mobile_rec_infer.onnx
        │       └── japan_dict.txt
        ├── korean
        │   └── v5
        │       ├── korean_PP-OCRv5_mobile_rec_infer.onnx
        │       └── ppocrv5_korean_dict.txt
        ├── latin
        │   └── v5
        │       ├── latin_PP-OCRv5_mobile_rec_infer.onnx
        │       └── ppocrv5_latin_dict.txt
        ├── ta
        │   └── v5
        │       ├── ppocrv5_ta_dict.txt
        │       └── ta_PP-OCRv5_mobile_rec_infer.onnx
        ├── te
        │   └── v5
        │       ├── ppocrv5_te_dict.txt
        │       └── te_PP-OCRv5_mobile_rec_infer.onnx
        └── th
            └── v5
                ├── ppocrv5_th_dict.txt
                └── th_PP-OCRv5_mobile_rec_infer.onnx

OCR Models and Their Supported Languages

Model	Supported Languages
korean_PP-OCRv5_mobile_rec	Korean, English
latin_PP-OCRv5_mobile_rec	French, German, Afrikaans, Italian, Spanish, Bosnian, Portuguese, Czech, Welsh, Danish, Estonian, Irish, Croatian, Uzbek, Hungarian, Serbian (Latin), Indonesian, Occitan, Icelandic, Lithuanian, Maori, Malay, Dutch, Norwegian, Polish, Slovak, Slovenian, Albanian, Swedish, Swahili, Tagalog, Turkish, Latin, Azerbaijani, Kurdish, Latvian, Maltese, Pali, Romanian, Vietnamese, Finnish, Basque, Galician, Luxembourgish, Romansh, Catalan, Quechua
eslav_PP-OCRv5_mobile_rec	Russian, Belarusian, Ukrainian, English
th_PP-OCRv5_mobile_rec	Thai, English
el_PP-OCRv5_mobile_rec	Greek, English
en_PP-OCRv5_mobile_rec	English
cyrillic_PP-OCRv5_mobile_rec	Russian, Belarusian, Ukrainian, Serbian (Cyrillic), Bulgarian, Mongolian, Abkhazian, Adyghe, Kabardian, Avar, Dargin, Ingush, Chechen, Lak, Lezgin, Tabasaran, Kazakh, Kyrgyz, Tajik, Macedonian, Tatar, Chuvash, Bashkir, Malian, Moldovan, Udmurt, Komi, Ossetian, Buryat, Kalmyk, Tuvan, Sakha, Karakalpak, English
arabic_PP-OCRv5_mobile_rec	Arabic, Persian, Uyghur, Urdu, Pashto, Kurdish, Sindhi, Balochi, English
devanagari_PP-OCRv5_mobile_rec	Hindi, Marathi, Nepali, Bihari, Maithili, Angika, Bhojpuri, Magahi, Santali, Newari, Konkani, Sanskrit, Haryanvi, English
ta_PP-OCRv5_mobile_rec	Tamil, English
te_PP-OCRv5_mobile_rec	Telugu, English

Layout models

Model use for document layout analysis includes 20 common categories: document title, paragraph title, text, page number, abstract, table, references, footnotes, header, footer, algorithm, formula, formula number, image, table, seal, figure_table title, chart, and sidebar text and lists of references

PP-DocLayoutV2
PP-DocLayoutV3

We only converted the layout analysis model, which by default already implemented layout detection. The difference is that layout analysis model doing extra for analyze the reading order. If you want an individual model that just do the layout detection, go check out: https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/ocr_modules/layout_detection.html#ii-supported-model-list

Correction models

Document Image Orientation Classification

The document image orientation classification module is aim to distinguish the orientation of document images and correct them through post-processing.

PP-LCNet_x1_0_doc_ori

Text Image Unwarping

Perform geometric transformations on images in order to correct issues such as document distortion, tilt, perspective deformation, etc., enabling more accurate recognition by subsequent text recognition modules.

UVDoc

Text Line Orientation Classification

The text line orientation classification module primarily distinguishes the orientation of text lines and corrects them using post-processing.

PP-LCNet_x0_25_textline_ori
PP-LCNet_x1_0_textline_ori

Table models

Table Structure Recognition

SLANet_plus

Table Classification

PP-LCNet_x1_0_table_cls

Table Cell Detection

RT-DETR-L_wired_table_cell_det
RT-DETR-L_wireless_table_cell_det

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ppu-paddle-ocr-models

File tree

OCR Models and Their Supported Languages

Layout models

Correction models

Document Image Orientation Classification

Text Image Unwarping

Text Line Orientation Classification

Table models

Table Structure Recognition

Table Classification

Table Cell Detection

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github		.github
correction		correction
detection		detection
layout		layout
recognition		recognition
table		table
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

ppu-paddle-ocr-models

File tree

OCR Models and Their Supported Languages

Layout models

Correction models

Document Image Orientation Classification

Text Image Unwarping

Text Line Orientation Classification

Table models

Table Structure Recognition

Table Classification

Table Cell Detection

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Packages