Skip to content

Conversation

@cau-git
Copy link
Contributor

@cau-git cau-git commented Oct 9, 2024

No description provided.

cau-git and others added 30 commits October 9, 2024 19:26
Signed-off-by: Christoph Auer <[email protected]>
Signed-off-by: Michele Dolfi <[email protected]>
Signed-off-by: Michele Dolfi <[email protected]>
Signed-off-by: Michele Dolfi <[email protected]>
Signed-off-by: Michele Dolfi <[email protected]>
Signed-off-by: Michele Dolfi <[email protected]>
Signed-off-by: Michele Dolfi <[email protected]>
Signed-off-by: Christoph Auer <[email protected]>
Since json-schema-for-humans dependency does not support python 3.13,
remove the generation of documentation in markdown of main docling types.
Remove 'ds' prefix from documentation scripts.
Update README.
Add python 3.13 in CI/CD workflow checks.

Signed-off-by: Cesar Berrospi Ramis <[email protected]>
Signed-off-by: Christoph Auer <[email protected]>
@cau-git cau-git marked this pull request as ready for review October 16, 2024 13:47
@cau-git cau-git changed the title fix: Various improvements and fixes backported from docling feat!: Expose DoclingDocument as main type, move old typing to legacy Oct 16, 2024
@cau-git cau-git requested review from PeterStaar-IBM, ceberam, dolfim-ibm and vagenas and removed request for ceberam and vagenas October 16, 2024 13:49
Copy link
Contributor

@dolfim-ibm dolfim-ibm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@cau-git cau-git merged commit 03df97f into main Oct 16, 2024
6 checks passed
@cau-git cau-git deleted the cau/improvements branch October 16, 2024 13:53
muhark added a commit to muhark/docling-core that referenced this pull request Mar 19, 2025
…docling-project#41)

* Fix area method of BoundingBox

Signed-off-by: Christoph Auer <[email protected]>

* add image placeholder

Signed-off-by: Michele Dolfi <[email protected]>

* enable picture label

Signed-off-by: Michele Dolfi <[email protected]>

* refactor captions and markdown

Signed-off-by: Michele Dolfi <[email protected]>

* add logic to skip repeated caption

Signed-off-by: Michele Dolfi <[email protected]>

* use DocItemLabel

Signed-off-by: Michele Dolfi <[email protected]>

* Extend default export labels, add convenience mehtods

Signed-off-by: Christoph Auer <[email protected]>

* Introduce ListItem API, with marker and enumerated properties

Signed-off-by: Christoph Auer <[email protected]>

* add classification and description in PictureData

Signed-off-by: Michele Dolfi <[email protected]>

* add molecule picture data

Signed-off-by: Michele Dolfi <[email protected]>

* Fixes for DoclingDocument and aligned methods on legacy doc

Signed-off-by: Christoph Auer <[email protected]>

* add advanced picture data content

Signed-off-by: Michele Dolfi <[email protected]>

* Many markdown export fixes, renaming BaseTableData

Signed-off-by: Christoph Auer <[email protected]>

* Rename module paths doc->legacy_doc, experimental->doc

Signed-off-by: Christoph Auer <[email protected]>

* feat: imageref with pil_image

Signed-off-by: Michele Dolfi <[email protected]>

* Small fixes

Signed-off-by: Christoph Auer <[email protected]>

* docs: remove documentation in markdown to support python 3.13 (docling-project#43)

Since json-schema-for-humans dependency does not support python 3.13,
remove the generation of documentation in markdown of main docling types.
Remove 'ds' prefix from documentation scripts.
Update README.
Add python 3.13 in CI/CD workflow checks.

Signed-off-by: Cesar Berrospi Ramis <[email protected]>

* Fix TableCell model validator

Signed-off-by: Christoph Auer <[email protected]>

* store list of classes in classification

Signed-off-by: Michele Dolfi <[email protected]>

* Fixes for DocumentOrigin mimetype validation

Signed-off-by: Christoph Auer <[email protected]>

* introduce picturedata as list of annotations

Signed-off-by: Michele Dolfi <[email protected]>

* feat: adapt hierarchical chunker to v2 DoclingDocument

[skip-ci]

Signed-off-by: Panos Vagenas <[email protected]>

* feat: add table support in chunker, incl. captions

Signed-off-by: Panos Vagenas <[email protected]>

* use Field constraints instead of conlist, refactor chunking types

Signed-off-by: Panos Vagenas <[email protected]>

* revert unnecessary doc module change

Signed-off-by: Panos Vagenas <[email protected]>

* align test data with upstream changes

Signed-off-by: Panos Vagenas <[email protected]>

* Update __init__.py on docling_core.types.doc

Signed-off-by: Christoph Auer <[email protected]>

* Remove DescriptionItem

Signed-off-by: Christoph Auer <[email protected]>

---------

Signed-off-by: Christoph Auer <[email protected]>
Signed-off-by: Michele Dolfi <[email protected]>
Signed-off-by: Cesar Berrospi Ramis <[email protected]>
Signed-off-by: Panos Vagenas <[email protected]>
Co-authored-by: Michele Dolfi <[email protected]>
Co-authored-by: Cesar Berrospi Ramis <[email protected]>
Co-authored-by: Panos Vagenas <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants