Open
Description
Description of the bug
I am unable to convert the attached PDF to DOCX.
I am using the parse method to simply convert all pages.
parse("sample.pdf", "sample.docx")
Here is the output that I get:
[INFO] Start to convert sample.pdf
[INFO] [1/4] Opening document...
[INFO] [2/4] Analyzing document...
[INFO] [3/4] Parsing pages...
[INFO] (1/1) Page 1
[INFO] [4/4] Creating pages...
[INFO] (1/1) Page 1
[ERROR] Ignore page 1 due to making page error: invalid literal for int() with base 16: 'x1'
[INFO] Terminated in 0.21s.
How to reproduce the bug
See my description. It has the sample PDF file and the code snippet causing the error.
pdf2docx version
0.5.8
Operating system
MacOS
Python version
3.10
Metadata
Metadata
Assignees
Labels
No labels
Activity