feat: Add uspto backend meta-data extraction #2284

vku-ibm · 2025-09-18T08:57:32Z

Adds extraction of the meta-data for uspto-backend that handles parsing of uspto patents in xml form.

Issue resolved by this Pull Request:
Resolves #2273

Checklist:

Documentation has been updated, if necessary.
Examples have been added, if necessary.
Tests have been added, if necessary.

Signed-off-by: Viktor Kuropiatnyk <[email protected]>

github-actions · 2025-09-18T08:57:40Z

✅ DCO Check Passed

Thanks @vku-ibm, all your commits are properly signed off. 🎉

mergify · 2025-09-18T08:58:06Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

cau-git · 2025-09-19T10:46:04Z

docling/backend/abstract_backend.py

    def supported_formats(cls) -> Set["InputFormat"]:
        pass

+    @abstractmethod


This @abstractmethod tag should not be necessary since you are providing a default implementation.

PeterStaar-IBM · 2025-09-20T10:57:34Z

@vku-ibm This is a really good addition! Could you, for the PDF pipelines,

Extract the meta-data through the docling-parse metadata extraction methods
Add the Table-of-contents from the pdf (if it has any) via docling-parse?

Stub for implementing uspto backend meta-data extraction

6455579

Signed-off-by: Viktor Kuropiatnyk <[email protected]>

cau-git reviewed Sep 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add uspto backend meta-data extraction #2284

feat: Add uspto backend meta-data extraction #2284

Uh oh!

vku-ibm commented Sep 18, 2025

Uh oh!

github-actions bot commented Sep 18, 2025

Uh oh!

mergify bot commented Sep 18, 2025

Uh oh!

cau-git Sep 19, 2025 •

edited

Loading

Uh oh!

PeterStaar-IBM commented Sep 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: Add uspto backend meta-data extraction #2284

Are you sure you want to change the base?

feat: Add uspto backend meta-data extraction #2284

Uh oh!

Conversation

vku-ibm commented Sep 18, 2025

Uh oh!

github-actions bot commented Sep 18, 2025

Uh oh!

mergify bot commented Sep 18, 2025

Merge Protections

🟢 Enforce conventional commit

Uh oh!

cau-git Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

PeterStaar-IBM commented Sep 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cau-git Sep 19, 2025 •

edited

Loading