Skip to content

PT-Perkasa-Pilar-Utama/ppu-paddle-ocr-models

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ppu-paddle-ocr-models

This repository contains models necessarily use by ppu-paddle-ocr library.

It consists of layout models, text detection models, and text recognition models.

The models are onnx sourced from https://www.paddleocr.ai/main/en/index.html.

.ort files are the same models pre-serialised in ONNX Runtime's native FlatBuffers format. They load 3-5x faster than the equivalent .onnx (session creation drops from ~24 ms to ~7 ms on M1) while producing byte-identical output. Load them with session: { graphOptimizationLevel: "disabled" } so ORT doesn't re-optimise the already-optimised graph.

export const MODEL_BASE_URL =
  "https://media.githubusercontent.com/media/PT-Perkasa-Pilar-Utama/ppu-paddle-ocr-models/main";

export const DICT_BASE_URL =
  "https://raw.githubusercontent.com/PT-Perkasa-Pilar-Utama/ppu-paddle-ocr-models/main";

File tree

.
├── LICENSE
├── README.md
├── correction
│   ├── PP-LCNet_x0_25_textline_ori.onnx
│   ├── PP-LCNet_x1_0_doc_ori.onnx
│   ├── PP-LCNet_x1_0_textline_ori.onnx
│   └── UVDoc.onnx
├── layout
│   ├── PP-DocLayoutV2.onnx
│   ├── PP-DocLayoutV2_labels.txt
│   ├── PP-DocLayoutV3.onnx
│   └── PP-DocLayoutV3_labels.txt
├── detection
│   ├── PP-OCRv4_mobile_det_infer.onnx
│   ├── PP-OCRv4_server_det_infer.onnx
│   ├── PP-OCRv5_mobile_det_infer.onnx
│   ├── PP-OCRv5_mobile_det_infer.ort
│   └── PP-OCRv5_server_det_infer.onnx
├── table
│   ├── PP-LCNet_x1_0_table_cls.onnx
│   ├── RT-DETR-L_wired_table_cell_det.onnx
│   ├── RT-DETR-L_wireless_table_cell_det.onnx
│   └── SLANet_plus.onnx
└── recognition
    ├── PP-OCRv3_mobile_rec_infer.onnx
    ├── PP-OCRv4_mobile_rec_infer.onnx
    ├── PP-OCRv4_server_rec_doc_infer.onnx
    ├── PP-OCRv4_server_rec_infer.onnx
    ├── PP-OCRv5_mobile_rec_infer.onnx
    ├── PP-OCRv5_server_rec_infer.onnx
    ├── ppocrv3_dict.txt
    ├── ppocrv4_dict.txt
    ├── ppocrv4_doc_dict.txt
    ├── ppocrv5_dict.txt
    └── multi
        ├── arabic
        │   └── v5
        │       ├── arabic_PP-OCRv5_mobile_rec_infer.onnx
        │       └── ppocrv5_arabic_dict.txt
        ├── cyrillic
        │   └── v5
        │       ├── cyrillic_PP-OCRv5_mobile_rec_infer.onnx
        │       └── ppocrv5_cyrillic_dict.txt
        ├── devanagari
        │   └── v5
        │       ├── devanagari_PP-OCRv5_mobile_rec_infer.onnx
        │       └── ppocrv5_devanagari_dict.txt
        ├── el
        │   └── v5
        │       ├── el_PP-OCRv5_mobile_rec_infer.onnx
        │       └── ppocrv5_el_dict.txt
        ├── en
        │   ├── v4
        │   │   ├── en_PP-OCRv4_mobile_rec_infer.onnx
        │   │   └── en_dict.txt
        │   └── v5
        │       ├── en_PP-OCRv5_mobile_rec_infer_int8.onnx
        │       ├── en_PP-OCRv5_mobile_rec_infer_int8.ort
        │       ├── en_PP-OCRv5_mobile_rec_infer.onnx
        │       ├── en_PP-OCRv5_mobile_rec_infer.ort
        │       └── ppocrv5_en_dict.txt
        ├── eslav
        │   └── v5
        │       ├── eslav_PP-OCRv5_mobile_rec_infer.onnx
        │       └── ppocrv5_eslav_dict.txt
        ├── japan
        │   └── v3
        │       ├── japan_PP-OCRv3_mobile_rec_infer.onnx
        │       └── japan_dict.txt
        ├── korean
        │   └── v5
        │       ├── korean_PP-OCRv5_mobile_rec_infer.onnx
        │       └── ppocrv5_korean_dict.txt
        ├── latin
        │   └── v5
        │       ├── latin_PP-OCRv5_mobile_rec_infer.onnx
        │       └── ppocrv5_latin_dict.txt
        ├── ta
        │   └── v5
        │       ├── ppocrv5_ta_dict.txt
        │       └── ta_PP-OCRv5_mobile_rec_infer.onnx
        ├── te
        │   └── v5
        │       ├── ppocrv5_te_dict.txt
        │       └── te_PP-OCRv5_mobile_rec_infer.onnx
        └── th
            └── v5
                ├── ppocrv5_th_dict.txt
                └── th_PP-OCRv5_mobile_rec_infer.onnx

OCR Models and Their Supported Languages

Model Supported Languages
korean_PP-OCRv5_mobile_rec Korean, English
latin_PP-OCRv5_mobile_rec French, German, Afrikaans, Italian, Spanish, Bosnian, Portuguese, Czech, Welsh, Danish, Estonian, Irish, Croatian, Uzbek, Hungarian, Serbian (Latin), Indonesian, Occitan, Icelandic, Lithuanian, Maori, Malay, Dutch, Norwegian, Polish, Slovak, Slovenian, Albanian, Swedish, Swahili, Tagalog, Turkish, Latin, Azerbaijani, Kurdish, Latvian, Maltese, Pali, Romanian, Vietnamese, Finnish, Basque, Galician, Luxembourgish, Romansh, Catalan, Quechua
eslav_PP-OCRv5_mobile_rec Russian, Belarusian, Ukrainian, English
th_PP-OCRv5_mobile_rec Thai, English
el_PP-OCRv5_mobile_rec Greek, English
en_PP-OCRv5_mobile_rec English
cyrillic_PP-OCRv5_mobile_rec Russian, Belarusian, Ukrainian, Serbian (Cyrillic), Bulgarian, Mongolian, Abkhazian, Adyghe, Kabardian, Avar, Dargin, Ingush, Chechen, Lak, Lezgin, Tabasaran, Kazakh, Kyrgyz, Tajik, Macedonian, Tatar, Chuvash, Bashkir, Malian, Moldovan, Udmurt, Komi, Ossetian, Buryat, Kalmyk, Tuvan, Sakha, Karakalpak, English
arabic_PP-OCRv5_mobile_rec Arabic, Persian, Uyghur, Urdu, Pashto, Kurdish, Sindhi, Balochi, English
devanagari_PP-OCRv5_mobile_rec Hindi, Marathi, Nepali, Bihari, Maithili, Angika, Bhojpuri, Magahi, Santali, Newari, Konkani, Sanskrit, Haryanvi, English
ta_PP-OCRv5_mobile_rec Tamil, English
te_PP-OCRv5_mobile_rec Telugu, English

Layout models

Model use for document layout analysis includes 20 common categories: document title, paragraph title, text, page number, abstract, table, references, footnotes, header, footer, algorithm, formula, formula number, image, table, seal, figure_table title, chart, and sidebar text and lists of references

  • PP-DocLayoutV2
  • PP-DocLayoutV3

We only converted the layout analysis model, which by default already implemented layout detection. The difference is that layout analysis model doing extra for analyze the reading order. If you want an individual model that just do the layout detection, go check out: https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/ocr_modules/layout_detection.html#ii-supported-model-list

Correction models

Document Image Orientation Classification

The document image orientation classification module is aim to distinguish the orientation of document images and correct them through post-processing.

  • PP-LCNet_x1_0_doc_ori

Text Image Unwarping

Perform geometric transformations on images in order to correct issues such as document distortion, tilt, perspective deformation, etc., enabling more accurate recognition by subsequent text recognition modules.

  • UVDoc

Text Line Orientation Classification

The text line orientation classification module primarily distinguishes the orientation of text lines and corrects them using post-processing.

  • PP-LCNet_x0_25_textline_ori
  • PP-LCNet_x1_0_textline_ori

Table models

Table Structure Recognition

  • SLANet_plus

Table Classification

  • PP-LCNet_x1_0_table_cls

Table Cell Detection

  • RT-DETR-L_wired_table_cell_det
  • RT-DETR-L_wireless_table_cell_det

About

Onnx model file collection (layout, detection, recognition, dictionary, table, correction) used in ppu-paddle-ocr

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Sponsor this project

 

Packages

 
 
 

Contributors