Skip to content

DocumentParser class refactoring #114004

Open
@salvatore-campagna

Description

@salvatore-campagna

Description

The DocumentParser class has undergone significant changes to enhance synthetic source support, which has introduced added complexity and difficult-to-understand logic into the code.

To address these issues, a more effective design could involve creating a base class that consolidates shared generic code, along with two specific implementations: one for parsing documents when the source is stored and another for parsing documents when the source is synthetic. This approach would not only simplify the overall document parsing logic but also allow for specialization in the parsing behavior, making it easier to manage and understand.

By separating the parsing logic based on the type of source, we can achieve a cleaner and more maintainable codebase, ultimately improving the functionality of the DocumentParser.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions