Open
Description
Hi, When I use partition_type(file=io.BytesIO(file.file.read()),languages=["chi_sim"])
to parse Chinese pdf documents, I found the result was to split the paragraph text into a line text as a elemet. And another problem is element type isn't accurate, should be UncategorizedText but actually is Title