Skip to content

Commit 18936e0

Browse files
authored
update for 3.0.1 (#15587)
1 parent 2cf8612 commit 18936e0

File tree

6 files changed

+67
-12
lines changed

6 files changed

+67
-12
lines changed

README.md

Lines changed: 10 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@
2323
</div>
2424

2525
## 🚀 简介
26-
PaddleOCR自发布以来凭借学术前沿算法和产业落地实践,受到了产学研各方的喜爱,并被广泛应用于众多知名开源项目,例如:Umi-OCR、OmniParser、MinerU、RAGFlow等,已成为广大开发者心中的开源OCR领域的首选工具。2025年5月20日,飞桨团队发布**PaddleOCR 3.0**,全面适配**飞桨框架3.0正式版**,进一步**提升文字识别精度**,支持**多文字类型识别****手写体识别**,满足大模型应用对**复杂文档高精度解析**的旺盛需求,结合**文心大模型4.5 Turbo**显著提升关键信息抽取精度,并新增**对昆仑芯、昇腾等国产硬件**的支持。
26+
PaddleOCR自发布以来凭借学术前沿算法和产业落地实践,受到了产学研各方的喜爱,并被广泛应用于众多知名开源项目,例如:Umi-OCR、OmniParser、MinerU、RAGFlow等,已成为广大开发者心中的开源OCR领域的首选工具。2025年5月20日,飞桨团队发布**PaddleOCR 3.0**,全面适配**飞桨框架3.0正式版**,进一步**提升文字识别精度**,支持**多文字类型识别****手写体识别**,满足大模型应用对**复杂文档高精度解析**的旺盛需求,结合**文心大模型4.5 Turbo**显著提升关键信息抽取精度,并新增**对昆仑芯、昇腾等国产硬件**的支持。完整使用文档请参考 [PaddleOCR 3.0 文档](https://paddlepaddle.github.io/PaddleOCR/latest/)
2727

2828
PaddleOCR 3.0**新增**三大特色能力:
2929
- 全场景文字识别模型[PP-OCRv5](docs/version3.x/algorithm/PP-OCRv5/PP-OCRv5.md):单模型支持五种文字类型和复杂手写体识别;整体识别精度相比上一代**提升13个百分点**[在线体验](https://aistudio.baidu.com/community/app/91660/webUI)
@@ -39,6 +39,14 @@ PaddleOCR 3.0除了提供优秀的模型库外,还提供好学易用的工具
3939

4040

4141
## 📣 最新动态
42+
🔥🔥2025.06.05: **PaddleOCR 3.0.1** 发布,包含:
43+
44+
- **优化部分模型和模型配置:**
45+
- 更新 PP-OCRv5默认模型配置,检测和识别均由mobile改为server模型。为了改善大多数的场景默认效果,配置中的参数`limit_side_len`由736改为64
46+
- 新增文本行方向分类`PP-LCNet_x1_0_textline_ori`模型,精度99.42%,OCR、PP-StructureV3、PP-ChatOCRv4产线的默认文本行方向分类器改为该模型
47+
- 优化文本行方向分类`PP-LCNet_x0_25_textline_ori`模型,精度提升3.3个百分点,当前精度98.85%
48+
- **优化和修复3.0.0版本部分存在的问题,[详情](https://paddlepaddle.github.io/PaddleOCR/latest/update/update.html)**
49+
4250
🔥🔥2025.05.20: **PaddleOCR 3.0** 正式发布,包含:
4351
- **PP-OCRv5**: 全场景高精度文字识别
4452

@@ -70,7 +78,7 @@ PaddleOCR 3.0除了提供优秀的模型库外,还提供好学易用的工具
7078

7179
```bash
7280
# 安装 paddleocr
73-
pip install paddleocr==3.0.0
81+
pip install paddleocr
7482
```
7583

7684
### 3. 命令行方式推理

README_en.md

Lines changed: 12 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@
2424
## 🚀 Introduction
2525
Since its initial release, PaddleOCR has gained widespread acclaim across academia, industry, and research communities, thanks to its cutting-edge algorithms and proven performance in real-world applications. It’s already powering popular open-source projects like Umi-OCR, OmniParser, MinerU, and RAGFlow, making it the go-to OCR toolkit for developers worldwide.
2626

27-
On May 20, 2025, the PaddlePaddle team unveiled PaddleOCR 3.0, fully compatible with the official release of the **PaddlePaddle 3.0** framework. This update further **boosts text-recognition accuracy**, adds support for **multiple text-type recognition** and **handwriting recognition**, and meets the growing demand from large-model applications for **high-precision parsing of complex documents**. When combined with the **ERNIE 4.5T**, it significantly enhances key-information extraction accuracy. PaddleOCR 3.0 also introduces support for domestic hardware platforms such as **KUNLUNXIN** and **Ascend**.
27+
On May 20, 2025, the PaddlePaddle team unveiled PaddleOCR 3.0, fully compatible with the official release of the **PaddlePaddle 3.0** framework. This update further **boosts text-recognition accuracy**, adds support for **multiple text-type recognition** and **handwriting recognition**, and meets the growing demand from large-model applications for **high-precision parsing of complex documents**. When combined with the **ERNIE 4.5T**, it significantly enhances key-information extraction accuracy. PaddleOCR 3.0 also introduces support for domestic hardware platforms such as **KUNLUNXIN** and **Ascend**. For the complete usage documentation, please refer to the [PaddleOCR 3.0 Documentation](https://paddlepaddle.github.io/PaddleOCR/latest/en/index.html).
2828

2929
Three Major New Features in PaddleOCR 3.0:
3030
- Universal-Scene Text Recognition Model [PP-OCRv5](./docs/version3.x/algorithm/PP-OCRv5/PP-OCRv5.en.md): A single model that handles five different text types plus complex handwriting. Overall recognition accuracy has increased by 13 percentage points over the previous generation. [Online Demo](https://aistudio.baidu.com/community/app/91660/webUI)
@@ -43,6 +43,16 @@ In addition to providing an outstanding model library, PaddleOCR 3.0 also offers
4343

4444

4545
## 📣 Recent updates
46+
47+
#### **🔥🔥 2025.06.05: Release of PaddleOCR 3.0.1, includes:**
48+
49+
- **Optimisation of certain models and model configurations:**
50+
- Updated the default model configuration for PP-OCRv5, changing both detection and recognition from mobile to server models. To improve default performance in most scenarios, the parameter `limit_side_len` in the configuration has been changed from 736 to 64.
51+
- Added a new text line orientation classification model `PP-LCNet_x1_0_textline_ori` with an accuracy of 99.42%. The default text line orientation classifier for OCR, PP-StructureV3, and PP-ChatOCRv4 pipelines has been updated to this model.
52+
- Optimised the text line orientation classification model `PP-LCNet_x0_25_textline_ori`, improving accuracy by 3.3 percentage points to a current accuracy of 98.85%.
53+
54+
- **Optimizations and fixes for some issues in version 3.0.0, [details](https://paddlepaddle.github.io/PaddleOCR/latest/en/update/update.html)**
55+
4656
🔥🔥2025.05.20: Official Release of **PaddleOCR v3.0**, including:
4757
- **PP-OCRv5**: High-Accuracy Text Recognition Model for All Scenarios - Instant Text from Images/PDFs.
4858
1. 🌐 Single-model support for **five** text types - Seamlessly process **Simplified Chinese, Traditional Chinese, Simplified Chinese Pinyin, English** and **Japanse** within a single model.
@@ -87,7 +97,7 @@ Install PaddlePaddle refer to [Installation Guide](https://www.paddlepaddle.org.
8797

8898
```bash
8999
# Install paddleocr
90-
pip install paddleocr==3.0.0
100+
pip install paddleocr
91101
```
92102

93103
### 3. Run inference by CLI

docs/quick_start.en.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ python -m pip install paddlepaddle-gpu==3.0.0 -i https://www.paddlepaddle.org.cn
2323
#### 2. Install `paddleocr`
2424

2525
```bash
26-
pip install paddleocr==3.0.0
26+
pip install paddleocr
2727
```
2828

2929
### Command Line Usage
@@ -66,11 +66,11 @@ pip install paddleocr==3.0.0
6666
# ocr = PaddleOCR(use_doc_orientation_classify=True, use_doc_unwarping=True) # text image preprocessing + text detection + textline orientation classification + text recognition
6767
# ocr = PaddleOCR(use_doc_orientation_classify=False, use_doc_unwarping=False) # text detection + textline orientation classification + text recognition
6868
# ocr = PaddleOCR(
69-
# text_detection_model_name="PP-OCRv5_server_det",
70-
# text_recognition_model_name="PP-OCRv5_server_rec",
69+
# text_detection_model_name="PP-OCRv5_mobile_det",
70+
# text_recognition_model_name="PP-OCRv5_mobile_rec",
7171
# use_doc_orientation_classify=False,
7272
# use_doc_unwarping=False,
73-
# use_textline_orientation=False) # Switch to PP-OCRv5_server models
73+
# use_textline_orientation=False) # Switch to PP-OCRv5_mobile models
7474
result = ocr.predict("./general_ocr_002.png")
7575
for res in result:
7676
res.print()

docs/quick_start.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ python -m pip install paddlepaddle-gpu==3.0.0 -i https://www.paddlepaddle.org.cn
2323
#### 2. 安装`paddleocr`
2424

2525
```bash
26-
pip install paddleocr==3.0.0
26+
pip install paddleocr
2727
```
2828

2929
### 命令行使用
@@ -66,11 +66,11 @@ pip install paddleocr==3.0.0
6666
# ocr = PaddleOCR(use_doc_orientation_classify=True, use_doc_unwarping=True) # 文本图像预处理+文本检测+方向分类+文本识别
6767
# ocr = PaddleOCR(use_doc_orientation_classify=False, use_doc_unwarping=False) # 文本检测+文本行方向分类+文本识别
6868
# ocr = PaddleOCR(
69-
# text_detection_model_name="PP-OCRv5_server_det",
70-
# text_recognition_model_name="PP-OCRv5_server_rec",
69+
# text_detection_model_name="PP-OCRv5_mobile_det",
70+
# text_recognition_model_name="PP-OCRv5_mobile_rec",
7171
# use_doc_orientation_classify=False,
7272
# use_doc_unwarping=False,
73-
# use_textline_orientation=False) # 更换 PP-OCRv5_server 模型
73+
# use_textline_orientation=False) # 更换 PP-OCRv5_mobile 模型
7474
result = ocr.predict("./general_ocr_002.png")
7575
for res in result:
7676
res.print()

docs/update/update.en.md

Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,26 @@ hide:
77

88
### Recently Update
99

10+
#### **🔥🔥 2025.06.05: Release of PaddleOCR v3.0.1, which includes:**
11+
12+
- **Optimisation of certain models and model configurations:**
13+
- Updated the default model configuration for PP-OCRv5, changing both detection and recognition from mobile to server models. To improve default performance in most scenarios, the parameter `limit_side_len` in the configuration has been changed from 736 to 64.
14+
- Added a new text line orientation classification model `PP-LCNet_x1_0_textline_ori` with an accuracy of 99.42%. The default text line orientation classifier for OCR, PP-StructureV3, and PP-ChatOCRv4 pipelines has been updated to this model.
15+
- Optimised the text line orientation classification model `PP-LCNet_x0_25_textline_ori`, improving accuracy by 3.3 percentage points to a current accuracy of 98.85%.
16+
17+
- **Optimisation of issues present in version 3.0.0:**
18+
- **Improved CLI usage experience:** When using the PaddleOCR CLI without passing any parameters, a usage prompt is now provided.
19+
- **New parameters added:** PP-ChatOCRv3 and PP-StructureV3 now support the `use_textline_orientation` parameter.
20+
- **CPU inference speed optimisation:** All pipeline CPU inferences now enable MKL-DNN by default.
21+
- **Support for C++ inference:** The detection and recognition concatenation part of PP-OCRv5 now supports C++ inference.
22+
23+
- **Fixes for issues present in version 3.0.0:**
24+
- Fixed an issue where PP-StructureV3 encountered CPU inference errors due to the inability to use MKL-DNN with formula and table recognition models.
25+
- Fixed an issue where GPU environments encountered the error `FatalError: Process abort signal is detected by the operating system` during inference.
26+
- Fixed type hint issues in some Python 3.8 environments.
27+
- Fixed the issue where the method `PPStructureV3.concatenate_markdown_pages` was missing.
28+
- Fixed an issue where specifying both `lang` and `model_name` when instantiating `paddleocr.PaddleOCR` resulted in `model_name` being ineffective.
29+
1030
#### **🔥🔥 2025.05.20: PaddleOCR 3.0 Official Release Highlights**
1131

1232
- **PP-OCRv5: All-Scene Text Recognition Model**

docs/update/update.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,23 @@ hide:
77

88
### 更新
99

10+
#### **🔥🔥2025.06.05: PaddleOCR v3.0.1 版本发布,包含:**
11+
- **优化部分模型和模型配置:**
12+
- 更新 PP-OCRv5默认模型配置,检测和识别均由mobile改为server模型。为了改善大多数的场景默认效果,配置中的参数`limit_side_len`由736改为64
13+
- 新增文本行方向分类`PP-LCNet_x1_0_textline_ori`模型,精度99.42%,OCR、PP-StructureV3、PP-ChatOCRv4产线的默认文本行方向分类器改为该模型
14+
- 优化文本行方向分类`PP-LCNet_x0_25_textline_ori`模型,精度提升3.3个百分点,当前精度98.85%
15+
- **优化3.0.0版本部分存在的问题**
16+
- **优化CLI使用体验:** 当使用PaddleOCR CLI不传入任何参数时,给出用法提示。
17+
- **新增参数:** PP-ChatOCRv3、PP-StructureV3支持`use_textline_orientation`参数。
18+
- **CPU推理速度优化:** 所有产线CPU推理默认开启MKL-DNN。
19+
- **C++推理支持:** PP-OCRv5的检测和识别串联部分支持C++推理
20+
- **修复3.0.0版本部分存在的问题**
21+
- 修复由于公式识别、表格识别模型无法使用MKL-DNN导致PP-StructureV3在部分cpu推理报错的问题
22+
- 修复在部分GPU环境中推理报`FatalError: Process abort signal is detected by the operating system`错误的问题
23+
- 修复部分Python3.8环境的type hint的问题
24+
- 修复`PPStructureV3.concatenate_markdown_pages`方法不存在的问题。
25+
- 修复实例化`paddleocr.PaddleOCR`时同时指定`lang``model_name``model_name`不生效的问题。
26+
1027
#### **🔥🔥2025.05.20: PaddleOCR 3.0 正式发布,包含:**
1128

1229
- **发布全场景文字识别模型PP-OCRv5:** 单模型支持五种文字类型和复杂手写体识别;整体识别精度相比上一代提升13个百分点。

0 commit comments

Comments
 (0)