You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+15-12
Original file line number
Diff line number
Diff line change
@@ -47,20 +47,23 @@ Easier to use: Just grab MinerU Desktop. No coding, no login, just a simple inte
47
47
</div>
48
48
49
49
# Changelog
50
-
- 2025/04/03 Release of version 1.3.0, with many changes in this version:
50
+
- 2025/04/03 Release of 1.3.0, in this version we made many optimizations and improvements:
51
51
- Installation and compatibility optimization
52
-
- By using paddleocr2torch, completely replaced the paddle framework and paddleocr used in the project, resolving conflicts between paddle and torch.
53
-
-Removed the use of layoutlmv3 in layout, solving compatibility issues caused by `detectron2`.
54
-
-Extended torch version compatibility to 2.2~2.6.
55
-
-CUDA compatibility extended to 11.8~12.6 (CUDA version determined by torch), addressing compatibility issues for some users with 50-series and H-series Nvidia GPUs.
56
-
-Python compatible versions extended to 3.10~3.12, resolving the issue of automatic downgrade to 0.6.1 during installation in non-3.10 environments.
57
-
- Performance optimization (compared to version 1.0.1, formula parsing speed improved by over 1400%, and overall parsing speed improved by over 500%)
58
-
-Improved parsing speed for batch processing of multiple small PDF files ([script example](demo/batch_demo.py)).
59
-
- Optimized the loading and usage of the mfr model, reducing memory usage and improving parsing speed. (requires re-executing the [model download process](docs/how_to_download_models_en.md) to obtain incremental updates of model files)
60
-
- Optimized memory usage, allowing the project to run with as little as 6GB.
61
-
- Improved running speed on mps devices.
52
+
- By removing the use of `layoutlmv3`in layout, resolved compatibility issues caused by `detectron2`.
53
+
-Torch version compatibility extended to 2.2~2.6 (excluding 2.5).
54
+
-CUDA compatibility supports 11.8/12.4/12.6 (CUDA version determined by torch), resolving compatibility issues for some users with 50-series and H-series GPUs.
55
+
-Python compatible versions expanded to 3.10~3.12, solving the problem of automatic downgrade to 0.6.1 during installation in non-3.10 environments.
56
+
-Offline deployment process optimized; no internet connection required after successful deployment to download any model files.
57
+
- Performance optimization
58
+
-By supporting batch processing of multiple PDF files ([script example](demo/batch_demo.py)), improved parsing speed for small files in batches (compared to version 1.0.1, formula parsing speed increased by over 1400%, overall parsing speed increased by over 500%).
59
+
- Optimized loading and usage of the mfr model, reducing GPU memory usage and improving parsing speed (requires re-execution of the [model download process](docs/how_to_download_models_en.md) to obtain incremental updates of model files).
60
+
- Optimized GPU memory usage, requiring only a minimum of 6GB to run this project.
61
+
- Improved running speed on MPS devices.
62
62
- Parsing effect optimization
63
-
- Updated the mfr model to unimernet(2503), solving the issue of missing line breaks in multi-line formulas.
63
+
- Updated the mfr model to `unimernet(2503)`, solving the issue of lost line breaks in multi-line formulas.
64
+
- Usability Optimization
65
+
- By using `paddleocr2torch`, completely replaced the use of the `paddle` framework and `paddleocr` in the project, resolving conflicts between `paddle` and `torch`, as well as thread safety issues caused by the `paddle` framework.
66
+
- Added a real-time progress bar during the parsing process to accurately track progress, making the wait less painful.
64
67
- 2025/03/03 1.2.1 released, fixed several bugs:
65
68
- Fixed the impact on punctuation marks during full-width to half-width conversion of letters and numbers
66
69
- Fixed caption matching inaccuracies in certain scenarios
0 commit comments