Investigate why DoclingDocument.load_from_doctags requires image for bbox extraction

There are references in the code such as this: https://github.com/docling-project/docling-core/blob/main/docling_core/types/doc/document.py#L5779

Where the `extract_bounding_box` method is called only if `image is not None`, which appears to make no sense, since it works without that constraint in other call sites. 

E.g. when parsing doctags that include list items, the list_item provenance is not populated.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Investigate why DoclingDocument.load_from_doctags requires image for bbox extraction #435

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Investigate why DoclingDocument.load_from_doctags requires image for bbox extraction #435

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions