Commit 8f05857
feat!: Expose DoclingDocument as main type, move old typing to legacy (docling-project#41)
* Fix area method of BoundingBox
Signed-off-by: Christoph Auer <[email protected]>
* add image placeholder
Signed-off-by: Michele Dolfi <[email protected]>
* enable picture label
Signed-off-by: Michele Dolfi <[email protected]>
* refactor captions and markdown
Signed-off-by: Michele Dolfi <[email protected]>
* add logic to skip repeated caption
Signed-off-by: Michele Dolfi <[email protected]>
* use DocItemLabel
Signed-off-by: Michele Dolfi <[email protected]>
* Extend default export labels, add convenience mehtods
Signed-off-by: Christoph Auer <[email protected]>
* Introduce ListItem API, with marker and enumerated properties
Signed-off-by: Christoph Auer <[email protected]>
* add classification and description in PictureData
Signed-off-by: Michele Dolfi <[email protected]>
* add molecule picture data
Signed-off-by: Michele Dolfi <[email protected]>
* Fixes for DoclingDocument and aligned methods on legacy doc
Signed-off-by: Christoph Auer <[email protected]>
* add advanced picture data content
Signed-off-by: Michele Dolfi <[email protected]>
* Many markdown export fixes, renaming BaseTableData
Signed-off-by: Christoph Auer <[email protected]>
* Rename module paths doc->legacy_doc, experimental->doc
Signed-off-by: Christoph Auer <[email protected]>
* feat: imageref with pil_image
Signed-off-by: Michele Dolfi <[email protected]>
* Small fixes
Signed-off-by: Christoph Auer <[email protected]>
* docs: remove documentation in markdown to support python 3.13 (docling-project#43)
Since json-schema-for-humans dependency does not support python 3.13,
remove the generation of documentation in markdown of main docling types.
Remove 'ds' prefix from documentation scripts.
Update README.
Add python 3.13 in CI/CD workflow checks.
Signed-off-by: Cesar Berrospi Ramis <[email protected]>
* Fix TableCell model validator
Signed-off-by: Christoph Auer <[email protected]>
* store list of classes in classification
Signed-off-by: Michele Dolfi <[email protected]>
* Fixes for DocumentOrigin mimetype validation
Signed-off-by: Christoph Auer <[email protected]>
* introduce picturedata as list of annotations
Signed-off-by: Michele Dolfi <[email protected]>
* feat: adapt hierarchical chunker to v2 DoclingDocument
[skip-ci]
Signed-off-by: Panos Vagenas <[email protected]>
* feat: add table support in chunker, incl. captions
Signed-off-by: Panos Vagenas <[email protected]>
* use Field constraints instead of conlist, refactor chunking types
Signed-off-by: Panos Vagenas <[email protected]>
* revert unnecessary doc module change
Signed-off-by: Panos Vagenas <[email protected]>
* align test data with upstream changes
Signed-off-by: Panos Vagenas <[email protected]>
* Update __init__.py on docling_core.types.doc
Signed-off-by: Christoph Auer <[email protected]>
* Remove DescriptionItem
Signed-off-by: Christoph Auer <[email protected]>
---------
Signed-off-by: Christoph Auer <[email protected]>
Signed-off-by: Michele Dolfi <[email protected]>
Signed-off-by: Cesar Berrospi Ramis <[email protected]>
Signed-off-by: Panos Vagenas <[email protected]>
Co-authored-by: Michele Dolfi <[email protected]>
Co-authored-by: Cesar Berrospi Ramis <[email protected]>
Co-authored-by: Panos Vagenas <[email protected]>1 parent 2b5bbd1 commit 8f05857
File tree
87 files changed
+28617
-16884
lines changed- .github/workflows
- docling_core
- transforms
- chunker
- id_generator
- metadata_extractor
- types
- doc
- experimental
- legacy_doc
- rec
- utils
- docs
- test
- data
- chunker
- docling_document/unit
- doc
- legacy_doc
- intermediates
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
87 files changed
+28617
-16884
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
9 | | - | |
| 9 | + | |
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
52 | 52 | | |
53 | 53 | | |
54 | 54 | | |
55 | | - | |
| 55 | + | |
56 | 56 | | |
57 | 57 | | |
58 | 58 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | 3 | | |
4 | | - | |
| 4 | + | |
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
| |||
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
24 | | - | |
| 24 | + | |
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
| |||
45 | 45 | | |
46 | 46 | | |
47 | 47 | | |
48 | | - | |
| 48 | + | |
49 | 49 | | |
50 | 50 | | |
51 | 51 | | |
52 | | - | |
| 52 | + | |
53 | 53 | | |
54 | 54 | | |
55 | | - | |
| 55 | + | |
56 | 56 | | |
57 | 57 | | |
58 | 58 | | |
| |||
61 | 61 | | |
62 | 62 | | |
63 | 63 | | |
64 | | - | |
| 64 | + | |
65 | 65 | | |
66 | 66 | | |
67 | | - | |
| 67 | + | |
68 | 68 | | |
69 | | - | |
| 69 | + | |
70 | 70 | | |
71 | 71 | | |
72 | 72 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | | - | |
9 | | - | |
10 | | - | |
11 | | - | |
12 | | - | |
13 | | - | |
14 | | - | |
15 | | - | |
| 8 | + | |
| 9 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | | - | |
8 | 7 | | |
9 | | - | |
| 8 | + | |
10 | 9 | | |
11 | | - | |
| 10 | + | |
12 | 11 | | |
13 | | - | |
14 | | - | |
| 12 | + | |
15 | 13 | | |
16 | | - | |
17 | | - | |
18 | 14 | | |
| 15 | + | |
| 16 | + | |
19 | 17 | | |
20 | | - | |
21 | | - | |
| 18 | + | |
| 19 | + | |
22 | 20 | | |
| 21 | + | |
| 22 | + | |
23 | 23 | | |
24 | | - | |
25 | | - | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
26 | 28 | | |
27 | | - | |
28 | | - | |
29 | | - | |
30 | 29 | | |
31 | | - | |
32 | | - | |
33 | | - | |
34 | | - | |
35 | | - | |
36 | | - | |
37 | | - | |
38 | | - | |
39 | | - | |
40 | | - | |
41 | | - | |
| 30 | + | |
| 31 | + | |
42 | 32 | | |
| 33 | + | |
| 34 | + | |
43 | 35 | | |
44 | | - | |
45 | | - | |
| 36 | + | |
| 37 | + | |
46 | 38 | | |
47 | | - | |
48 | | - | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
49 | 43 | | |
50 | 44 | | |
51 | 45 | | |
52 | | - | |
| 46 | + | |
53 | 47 | | |
54 | 48 | | |
55 | | - | |
| 49 | + | |
56 | 50 | | |
57 | 51 | | |
58 | 52 | | |
59 | | - | |
| 53 | + | |
60 | 54 | | |
61 | 55 | | |
62 | 56 | | |
63 | 57 | | |
64 | 58 | | |
65 | | - | |
| 59 | + | |
66 | 60 | | |
67 | 61 | | |
68 | | - | |
69 | | - | |
70 | | - | |
71 | | - | |
72 | | - | |
73 | | - | |
74 | | - | |
0 commit comments