Commit c305d10
authored
Fix PDFMiner bug (#253)
Issue:
In some cases, PDFMiner identifies an image document as a full page and
in other installations not. It is difficult to find out when PDFMiner
behaves in one way or another. In either case tested, the version is
`pdfminer.six v20221105`. The solution is to ignore any annotation
coming from Chipper in case the full page clearing code is activated.
Not sure if this is relevant to other models.
---------
Co-authored-by: Antonio Jimeno Yepes <[email protected]>1 parent 2493089 commit c305d10
File tree
4 files changed
+9
-2
lines changed- unstructured_inference
- inference
- models
4 files changed
+9
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
2 | 2 | | |
| 3 | + | |
| 4 | + | |
3 | 5 | | |
4 | 6 | | |
5 | 7 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
125 | 125 | | |
126 | 126 | | |
127 | 127 | | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
128 | 131 | | |
129 | 132 | | |
130 | 133 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
| 8 | + | |
8 | 9 | | |
9 | 10 | | |
10 | 11 | | |
| |||
134 | 135 | | |
135 | 136 | | |
136 | 137 | | |
| 138 | + | |
137 | 139 | | |
138 | 140 | | |
139 | 141 | | |
| |||
0 commit comments