update for 3.0.1 (#15587)

cuicheng01 · web-flow · commit 18936e0fb78f · 2025-06-05T11:24:29.000+08:00
diff --git a/README.md b/README.md
@@ -23,7 +23,7 @@
 </div>
 
 ## 🚀 简介
-PaddleOCR自发布以来凭借学术前沿算法和产业落地实践，受到了产学研各方的喜爱，并被广泛应用于众多知名开源项目，例如：Umi-OCR、OmniParser、MinerU、RAGFlow等，已成为广大开发者心中的开源OCR领域的首选工具。2025年5月20日，飞桨团队发布**PaddleOCR 3.0**，全面适配**飞桨框架3.0正式版**，进一步**提升文字识别精度**，支持**多文字类型识别**和**手写体识别**，满足大模型应用对**复杂文档高精度解析**的旺盛需求，结合**文心大模型4.5 Turbo**显著提升关键信息抽取精度，并新增**对昆仑芯、昇腾等国产硬件**的支持。
+PaddleOCR自发布以来凭借学术前沿算法和产业落地实践，受到了产学研各方的喜爱，并被广泛应用于众多知名开源项目，例如：Umi-OCR、OmniParser、MinerU、RAGFlow等，已成为广大开发者心中的开源OCR领域的首选工具。2025年5月20日，飞桨团队发布**PaddleOCR 3.0**，全面适配**飞桨框架3.0正式版**，进一步**提升文字识别精度**，支持**多文字类型识别**和**手写体识别**，满足大模型应用对**复杂文档高精度解析**的旺盛需求，结合**文心大模型4.5 Turbo**显著提升关键信息抽取精度，并新增**对昆仑芯、昇腾等国产硬件**的支持。完整使用文档请参考 [PaddleOCR 3.0 文档](https://paddlepaddle.github.io/PaddleOCR/latest/)。
 
 PaddleOCR 3.0**新增**三大特色能力：
 - 全场景文字识别模型[PP-OCRv5](docs/version3.x/algorithm/PP-OCRv5/PP-OCRv5.md)：单模型支持五种文字类型和复杂手写体识别；整体识别精度相比上一代**提升13个百分点**。[在线体验](https://aistudio.baidu.com/community/app/91660/webUI)
@@ -39,6 +39,14 @@ PaddleOCR 3.0除了提供优秀的模型库外，还提供好学易用的工具
 
 
 ## 📣 最新动态
+🔥🔥2025.06.05: **PaddleOCR 3.0.1** 发布，包含：
+
+- **优化部分模型和模型配置：**
+  - 更新 PP-OCRv5默认模型配置，检测和识别均由mobile改为server模型。为了改善大多数的场景默认效果，配置中的参数`limit_side_len`由736改为64
+  - 新增文本行方向分类`PP-LCNet_x1_0_textline_ori`模型，精度99.42%，OCR、PP-StructureV3、PP-ChatOCRv4产线的默认文本行方向分类器改为该模型
+  - 优化文本行方向分类`PP-LCNet_x0_25_textline_ori`模型，精度提升3.3个百分点，当前精度98.85%
+- **优化和修复3.0.0版本部分存在的问题，[详情](https://paddlepaddle.github.io/PaddleOCR/latest/update/update.html)**
+
 🔥🔥2025.05.20: **PaddleOCR 3.0** 正式发布，包含：
 - **PP-OCRv5**: 全场景高精度文字识别
 
@@ -70,7 +78,7 @@ PaddleOCR 3.0除了提供优秀的模型库外，还提供好学易用的工具
 
 ```bash
 # 安装 paddleocr
-pip install paddleocr==3.0.0
+pip install paddleocr
 ```
 
 ### 3. 命令行方式推理
diff --git a/README_en.md b/README_en.md
@@ -24,7 +24,7 @@
 ## 🚀 Introduction
 Since its initial release, PaddleOCR has gained widespread acclaim across academia, industry, and research communities, thanks to its cutting-edge algorithms and proven performance in real-world applications. It’s already powering popular open-source projects like Umi-OCR, OmniParser, MinerU, and RAGFlow, making it the go-to OCR toolkit for developers worldwide.
 
-On May 20, 2025, the PaddlePaddle team unveiled PaddleOCR 3.0, fully compatible with the official release of the **PaddlePaddle 3.0** framework. This update further **boosts text-recognition accuracy**, adds support for **multiple text-type recognition** and **handwriting recognition**, and meets the growing demand from large-model applications for **high-precision parsing of complex documents**. When combined with the **ERNIE 4.5T**, it significantly enhances key-information extraction accuracy. PaddleOCR 3.0 also introduces support for domestic hardware platforms such as **KUNLUNXIN** and **Ascend**.
+On May 20, 2025, the PaddlePaddle team unveiled PaddleOCR 3.0, fully compatible with the official release of the **PaddlePaddle 3.0** framework. This update further **boosts text-recognition accuracy**, adds support for **multiple text-type recognition** and **handwriting recognition**, and meets the growing demand from large-model applications for **high-precision parsing of complex documents**. When combined with the **ERNIE 4.5T**, it significantly enhances key-information extraction accuracy. PaddleOCR 3.0 also introduces support for domestic hardware platforms such as **KUNLUNXIN** and **Ascend**. For the complete usage documentation, please refer to the [PaddleOCR 3.0 Documentation](https://paddlepaddle.github.io/PaddleOCR/latest/en/index.html).
 
 Three Major New Features in PaddleOCR 3.0:
 - Universal-Scene Text Recognition Model [PP-OCRv5](./docs/version3.x/algorithm/PP-OCRv5/PP-OCRv5.en.md): A single model that handles five different text types plus complex handwriting. Overall recognition accuracy has increased by 13 percentage points over the previous generation. [Online Demo](https://aistudio.baidu.com/community/app/91660/webUI)
@@ -43,6 +43,16 @@ In addition to providing an outstanding model library, PaddleOCR 3.0 also offers
 
 
 ## 📣 Recent updates
+
+#### **🔥🔥 2025.06.05: Release of PaddleOCR 3.0.1, includes:**
+
+- **Optimisation of certain models and model configurations:**
+  - Updated the default model configuration for PP-OCRv5, changing both detection and recognition from mobile to server models. To improve default performance in most scenarios, the parameter `limit_side_len` in the configuration has been changed from 736 to 64.
+  - Added a new text line orientation classification model `PP-LCNet_x1_0_textline_ori` with an accuracy of 99.42%. The default text line orientation classifier for OCR, PP-StructureV3, and PP-ChatOCRv4 pipelines has been updated to this model.
+  - Optimised the text line orientation classification model `PP-LCNet_x0_25_textline_ori`, improving accuracy by 3.3 percentage points to a current accuracy of 98.85%.
+
+- **Optimizations and fixes for some issues in version 3.0.0, [details](https://paddlepaddle.github.io/PaddleOCR/latest/en/update/update.html)**
+
 🔥🔥2025.05.20: Official Release of **PaddleOCR v3.0**, including:
 - **PP-OCRv5**: High-Accuracy Text Recognition Model for All Scenarios - Instant Text from Images/PDFs.
    1. 🌐 Single-model support for **five** text types - Seamlessly process **Simplified Chinese, Traditional Chinese, Simplified Chinese Pinyin, English** and **Japanse** within a single model.
@@ -87,7 +97,7 @@ Install PaddlePaddle refer to [Installation Guide](https://www.paddlepaddle.org.
 
 ```bash
 # Install paddleocr
-pip install paddleocr==3.0.0
+pip install paddleocr
 ```
 
 ### 3. Run inference by CLI
diff --git a/docs/quick_start.en.md b/docs/quick_start.en.md
@@ -23,7 +23,7 @@ python -m pip install paddlepaddle-gpu==3.0.0 -i https://www.paddlepaddle.org.cn
 #### 2. Install `paddleocr`
 
 ```bash
-pip install paddleocr==3.0.0
+pip install paddleocr
 ```
 
 ### Command Line Usage
@@ -66,11 +66,11 @@ pip install paddleocr==3.0.0
     # ocr = PaddleOCR(use_doc_orientation_classify=True, use_doc_unwarping=True) # text image preprocessing + text detection + textline orientation classification + text recognition
     # ocr = PaddleOCR(use_doc_orientation_classify=False, use_doc_unwarping=False) # text detection + textline orientation classification + text recognition
     # ocr = PaddleOCR(
-    #     text_detection_model_name="PP-OCRv5_server_det",
-    #     text_recognition_model_name="PP-OCRv5_server_rec",
+    #     text_detection_model_name="PP-OCRv5_mobile_det",
+    #     text_recognition_model_name="PP-OCRv5_mobile_rec",
     #     use_doc_orientation_classify=False,
     #     use_doc_unwarping=False,
-    #     use_textline_orientation=False) # Switch to PP-OCRv5_server models
+    #     use_textline_orientation=False) # Switch to PP-OCRv5_mobile models
     result = ocr.predict("./general_ocr_002.png")
     for res in result:
         res.print()
diff --git a/docs/quick_start.md b/docs/quick_start.md
@@ -23,7 +23,7 @@ python -m pip install paddlepaddle-gpu==3.0.0 -i https://www.paddlepaddle.org.cn
 #### 2. 安装`paddleocr`
 
 ```bash
-pip install paddleocr==3.0.0
+pip install paddleocr
 ```
 
 ### 命令行使用
@@ -66,11 +66,11 @@ pip install paddleocr==3.0.0
     # ocr = PaddleOCR(use_doc_orientation_classify=True, use_doc_unwarping=True) # 文本图像预处理+文本检测+方向分类+文本识别
     # ocr = PaddleOCR(use_doc_orientation_classify=False, use_doc_unwarping=False) # 文本检测+文本行方向分类+文本识别
     # ocr = PaddleOCR(
-    #     text_detection_model_name="PP-OCRv5_server_det",
-    #     text_recognition_model_name="PP-OCRv5_server_rec",
+    #     text_detection_model_name="PP-OCRv5_mobile_det",
+    #     text_recognition_model_name="PP-OCRv5_mobile_rec",
     #     use_doc_orientation_classify=False,
     #     use_doc_unwarping=False,
-    #     use_textline_orientation=False) # 更换 PP-OCRv5_server 模型
+    #     use_textline_orientation=False) # 更换 PP-OCRv5_mobile 模型
     result = ocr.predict("./general_ocr_002.png")
     for res in result:
         res.print()
diff --git a/docs/update/update.en.md b/docs/update/update.en.md
@@ -7,6 +7,26 @@ hide:
 
 ### Recently Update
 
+#### **🔥🔥 2025.06.05: Release of PaddleOCR v3.0.1, which includes:**
+
+- **Optimisation of certain models and model configurations:**
+  - Updated the default model configuration for PP-OCRv5, changing both detection and recognition from mobile to server models. To improve default performance in most scenarios, the parameter `limit_side_len` in the configuration has been changed from 736 to 64.
+  - Added a new text line orientation classification model `PP-LCNet_x1_0_textline_ori` with an accuracy of 99.42%. The default text line orientation classifier for OCR, PP-StructureV3, and PP-ChatOCRv4 pipelines has been updated to this model.
+  - Optimised the text line orientation classification model `PP-LCNet_x0_25_textline_ori`, improving accuracy by 3.3 percentage points to a current accuracy of 98.85%.
+
+- **Optimisation of issues present in version 3.0.0:**
+  - **Improved CLI usage experience:** When using the PaddleOCR CLI without passing any parameters, a usage prompt is now provided.
+  - **New parameters added:** PP-ChatOCRv3 and PP-StructureV3 now support the `use_textline_orientation` parameter.
+  - **CPU inference speed optimisation:** All pipeline CPU inferences now enable MKL-DNN by default.
+  - **Support for C++ inference:** The detection and recognition concatenation part of PP-OCRv5 now supports C++ inference.
+
+- **Fixes for issues present in version 3.0.0:**
+  - Fixed an issue where PP-StructureV3 encountered CPU inference errors due to the inability to use MKL-DNN with formula and table recognition models.
+  - Fixed an issue where GPU environments encountered the error `FatalError: Process abort signal is detected by the operating system` during inference.
+  - Fixed type hint issues in some Python 3.8 environments.
+  - Fixed the issue where the method `PPStructureV3.concatenate_markdown_pages` was missing.
+  - Fixed an issue where specifying both `lang` and `model_name` when instantiating `paddleocr.PaddleOCR` resulted in `model_name` being ineffective.
+
 #### **🔥🔥 2025.05.20: PaddleOCR 3.0 Official Release Highlights**
 
 - **PP-OCRv5: All-Scene Text Recognition Model**  
diff --git a/docs/update/update.md b/docs/update/update.md
@@ -7,6 +7,23 @@ hide:
 
 ### 更新
 
+#### **🔥🔥2025.06.05: PaddleOCR v3.0.1 版本发布，包含：**
+- **优化部分模型和模型配置：**
+  - 更新 PP-OCRv5默认模型配置，检测和识别均由mobile改为server模型。为了改善大多数的场景默认效果，配置中的参数`limit_side_len`由736改为64
+  - 新增文本行方向分类`PP-LCNet_x1_0_textline_ori`模型，精度99.42%，OCR、PP-StructureV3、PP-ChatOCRv4产线的默认文本行方向分类器改为该模型
+  - 优化文本行方向分类`PP-LCNet_x0_25_textline_ori`模型，精度提升3.3个百分点，当前精度98.85%
+- **优化3.0.0版本部分存在的问题**
+  - **优化CLI使用体验：** 当使用PaddleOCR CLI不传入任何参数时，给出用法提示。
+  - **新增参数：** PP-ChatOCRv3、PP-StructureV3支持`use_textline_orientation`参数。
+  - **CPU推理速度优化：** 所有产线CPU推理默认开启MKL-DNN。
+  - **C++推理支持：** PP-OCRv5的检测和识别串联部分支持C++推理
+- **修复3.0.0版本部分存在的问题**
+  - 修复由于公式识别、表格识别模型无法使用MKL-DNN导致PP-StructureV3在部分cpu推理报错的问题
+  - 修复在部分GPU环境中推理报`FatalError: Process abort signal is detected by the operating system`错误的问题
+  - 修复部分Python3.8环境的type hint的问题
+  - 修复`PPStructureV3.concatenate_markdown_pages`方法不存在的问题。
+  - 修复实例化`paddleocr.PaddleOCR`时同时指定`lang`和`model_name`时`model_name`不生效的问题。
+
 #### **🔥🔥2025.05.20: PaddleOCR 3.0 正式发布，包含：**
 
 - **发布全场景文字识别模型PP-OCRv5：** 单模型支持五种文字类型和复杂手写体识别；整体识别精度相比上一代提升13个百分点。