Skip to content

Conversation

@vagenas
Copy link
Collaborator

@vagenas vagenas commented Oct 21, 2025

Main goal has been to establish metadata model that:

  • allows both prescribed and custom fields, with a way to distinguish the two
  • allows both common and type-specific fields, so that we can further specify the metadata reqs e.g. for PictureItem
  • accounts for serialization and deserialization, e.g. ideally avoiding serializing additional data if no meta info available

@mergify
Copy link

mergify bot commented Oct 21, 2025

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

🟢 Require two reviewer for test updates

Wonderful, this rule succeeded.

When test data is updated, we require two reviewers

  • #approved-reviews-by >= 2

@codecov
Copy link

codecov bot commented Oct 21, 2025

Codecov Report

❌ Patch coverage is 94.32624% with 16 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
docling_core/types/doc/document.py 93.07% 9 Missing ⚠️
docling_core/transforms/serializer/doctags.py 80.00% 4 Missing ⚠️
docling_core/transforms/serializer/html.py 92.30% 2 Missing ⚠️
docling_core/transforms/serializer/markdown.py 96.96% 1 Missing ⚠️

📢 Thoughts on this report? Let us know!

@github-actions
Copy link
Contributor

github-actions bot commented Oct 21, 2025

DCO Check Passed

Thanks @vagenas, all your commits are properly signed off. 🎉

Signed-off-by: Panos Vagenas <[email protected]>
Signed-off-by: Panos Vagenas <[email protected]>
Signed-off-by: Panos Vagenas <[email protected]>
Signed-off-by: Panos Vagenas <[email protected]>
Signed-off-by: Panos Vagenas <[email protected]>
Signed-off-by: Panos Vagenas <[email protected]>
Signed-off-by: Panos Vagenas <[email protected]>
Signed-off-by: Panos Vagenas <[email protected]>
…amless chunking integration

Signed-off-by: Panos Vagenas <[email protected]>
@vagenas vagenas marked this pull request as ready for review October 28, 2025 22:36
@dosubot
Copy link

dosubot bot commented Oct 28, 2025

Documentation Updates

Checked 3 published document(s). No updates required.

How did I do? Any feedback?  Join Discord

PeterStaar-IBM
PeterStaar-IBM previously approved these changes Oct 29, 2025
Copy link
Contributor

@PeterStaar-IBM PeterStaar-IBM left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm!

However, it would be good to update the init.py with the newly introduced classes.

Copy link
Contributor

@PeterStaar-IBM PeterStaar-IBM left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm!

@vagenas vagenas merged commit 2ee3cac into main Oct 30, 2025
13 checks passed
@vagenas vagenas deleted the redesign-metadata branch October 30, 2025 10:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants