Skip to content

0.10.10

Compare
Choose a tag to compare
@cragwolfe cragwolfe released this 31 Aug 02:14
· 1035 commits to main since this release
a4ec43a

0.10.10

Enhancements

  • Adds text as an input parameter to partition_xml.
  • partition_xml no longer runs through partition_text, avoiding incorrect splitting
    on carriage returns in the XML. Since partition_xml no longer calls partition_text,
    min_partition and max_partition are no longer supported in partition_xml.
  • Bump unstructured-inference==0.5.18, change non-default detectron2 classification threshold
  • Upgrade base image from rockylinux 8 to rockylinux 9
  • Serialize IngestDocs to JSON when passing to subprocesses

Features

Fixes

  • Fix a bug where mismatched elements and bboxes are passed into add_pytesseract_bbox_to_elements