Skip to content

Conversation

pbrady
Copy link

@pbrady pbrady commented Oct 10, 2025

The VLM pipeline previously crashed with nested Markdown lists. This fix properly preserves hierarchy by tracking parent-child relationships through ListGroup containers.

Issue resolved by this Pull Request:
Resolves #2301

Checklist:

  • Tests have been added, if necessary.

Copy link
Contributor

github-actions bot commented Oct 10, 2025

DCO Check Passed

Thanks @pbrady, all your commits are properly signed off. 🎉

Copy link

dosubot bot commented Oct 10, 2025

Related Documentation

Checked 2 published document(s). No updates required.

You have 5 draft document(s). Publish docs to keep them always up-to-date

How did I do? Any feedback?  Join Discord

Copy link

mergify bot commented Oct 10, 2025

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

Fixes docling-project#2301

- Use iterate_items(with_groups=True) to reveal ListGroup containers
- Track parent relationships with level-based stack
- Copy and clear children before append to satisfy constraints
- Add type annotation for level_stack (mypy compliance)
- Include comprehensive test suite for nested list structures

The VLM pipeline previously crashed with nested Markdown lists and
flattened document structure. This fix properly preserves hierarchy
by tracking parent-child relationships through ListGroup containers.

Signed-off-by: Peter Brady <[email protected]>
@pbrady pbrady force-pushed the fix/issue-2301-vlm-nested-list-structure branch from d93427a to abac00a Compare October 10, 2025 16:46
@dolfim-ibm dolfim-ibm requested review from cau-git and vagenas October 13, 2025 07:03
Copy link

codecov bot commented Oct 13, 2025

Codecov Report

❌ Patch coverage is 93.75000% with 1 line in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
docling/pipeline/vlm_pipeline.py 93.75% 1 Missing ⚠️

📢 Thoughts on this report? Let us know!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Error processing Markdown nested list with VLM pipeline

3 participants