Commit a501616
Table processing (#72)
* First commit
* Table processing in document layout
* Platform x86_64 check'
* PaddleOCR integrated
* Deactivate show_log in paddleocr
* Utilize layout updates
* Formatting and linting
* Correct how linting ignores are accumulated
* Removed fitz
* Bug fixed in intersect_rect
* Bump to default 200 dpi
* Updated README with instructions to install paddleocr
* Fixed typo
* Deal with empty case
* Formatting
* Updates to pass flake8
* Added table test
* Fixed test
* Typing changes
* formatting for large fixture
* Up pixel to reflect new dpi
* Make table extraction opt-in
* Change content to check for
* Add install targets for paddleocr
* Add optional pip install for paddleocr
* Update README.md
Updated paddleocr installation instructions
* Remove unused functions
* New image for table testing
* Remove non-unique assignment case
* Correct slot_into_contains arguments
* Test for nms
* update fixtures
* Added test for nms
* fix for disable table extraction by default
* Revised test
* Update old tests
* reuse postprocess
* Additional tests
* More rect tests, extract_text_from_spans
* Remove unused code
* Align supercells test
* Updated removal supercell test
* header_supercell_tree test
* name change to forked paddleocr
* Updated installation and removel of a print statement
* tidied file
* Version update
* linting
* Updated test
* Changed Makefile
---------
Co-authored-by: Antonio Jimeno Yepes <[email protected]>
Co-authored-by: Alan Bertl <[email protected]>1 parent 4a52922 commit a501616
File tree
14 files changed
+1894
-13
lines changed- sample-docs
- test_unstructured_inference
- inference
- models
- unstructured_inference
- inference
- models
14 files changed
+1894
-13
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
2 | 2 | | |
| 3 | + | |
3 | 4 | | |
4 | 5 | | |
5 | 6 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
23 | | - | |
| 23 | + | |
24 | 24 | | |
25 | 25 | | |
26 | 26 | | |
| |||
31 | 31 | | |
32 | 32 | | |
33 | 33 | | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
34 | 38 | | |
35 | 39 | | |
36 | 40 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
34 | 34 | | |
35 | 35 | | |
36 | 36 | | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
37 | 48 | | |
38 | 49 | | |
39 | 50 | | |
| |||
Loading
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
6 | | - | |
| 6 | + | |
7 | 7 | | |
8 | 8 | | |
9 | 9 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
| 21 | + | |
21 | 22 | | |
22 | 23 | | |
23 | 24 | | |
| |||
60 | 61 | | |
61 | 62 | | |
62 | 63 | | |
63 | | - | |
| 64 | + | |
64 | 65 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
186 | 186 | | |
187 | 187 | | |
188 | 188 | | |
189 | | - | |
| 189 | + | |
190 | 190 | | |
191 | 191 | | |
192 | 192 | | |
193 | 193 | | |
| 194 | + | |
194 | 195 | | |
195 | 196 | | |
196 | 197 | | |
| |||
0 commit comments