feat(serializer): add MsExcelMarkdownDocSerializer for sheet-name headings#587
Conversation
…dings Add `MsExcelMarkdownFallbackSerializer` and `MsExcelMarkdownDocSerializer` to the serializer package so that `GroupLabel.SHEET` groups are rendered as level-2 Markdown headings when exporting Excel-sourced documents. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Merge ProtectionsYour pull request matches the following merge protections and will not be merged until they are valid. 🟢 Enforce conventional commitWonderful, this rule succeeded.Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/
|
|
✅ DCO Check Passed Thanks @Smeet23, all your commits are properly signed off. 🎉 |
|
@Smeet23 Please run the DCO and make sure all checks work! |
I, Smeet23 <smeetagrawal2003@gmail.com>, hereby add my Signed-off-by to this commit: 2a3808e Signed-off-by: Smeet23 <smeetagrawal2003@gmail.com>
|
@PeterStaar-IBM Done! Added a DCO remediation commit and all checks are now passing — DCO, dco_advisor, and Mergify merge protections are all green. Thanks for the review and the approval! |
|
@Smeet23 Please run the |
Signed-off-by: Smeet23 <smeetagrawal2003@gmail.com>
|
Hi @PeterStaar-IBM — thanks for the earlier LGTM! I've pushed a small follow-up fix: the |
Codecov Report❌ Patch coverage is
📢 Thoughts on this report? Let us know! |
Summary
MsExcelMarkdownFallbackSerializerandMsExcelMarkdownDocSerializertodocling_core/transforms/serializer/markdown_excel.pyGroupLabel.SHEETgroups as level-2 Markdown headings (## <sheet_name>) when exporting Excel-sourcedDoclingDocumentsdoclingrepo but belongs here per reviewer feedbackTest plan
MsExcelMarkdownDocSerializerrenders sheet names as## <name>headingsfrom docling_core.transforms.serializer.markdown_excel import MsExcelMarkdownDocSerializerworks correctly🤖 Generated with Claude Code