Skip to content

0.6.0

Compare
Choose a tag to compare
@qued qued released this 21 Apr 17:11
· 1381 commits to main since this release
dc4147d

0.6.0

Enhancements

  • Adds an ssl_verify kwarg to partition and partition_html to enable turning off
    SSL verification for HTTP requests. SSL verification is on by default.
  • Allows users to pass in ocr language to partition_pdf and partition_image through
    the ocr_language kwarg. ocr_language corresponds to the code for the language pack
    in Tesseract. You will need to install the relevant Tesseract language pack to use a
    given language.

Features

  • Table extraction is now possible for pdfs from partition and partition_pdf.
  • Adds support for extracting attachments from .msg files

Fixes