Skip to content

Ignored text before tables #1251

Open
Open
@panagiotis-tsolakis

Description

@panagiotis-tsolakis

In a sample of 2 pdfs that I converted into TEI with Grobid, I noticed that text lines preceding a table were sometimes dropped and did not appear in the final TEI file. The following screenshots come from two consecutive pages of a pdf file:

Image

Image

The text underlined in yellow was ignored by Grobid, as you can see in the TEI file. The text of the table was ignored as well by Grobid.

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions