Skip to content

0.7.7

Choose a tag to compare

@MthwRobinson MthwRobinson released this 20 Jun 19:13
· 1347 commits to main since this release
c53ce11

0.7.7

Enhancements

  • Adds functionality to replace the MIME encodings for eml files with one of the common encodings if a unicode error occurs
  • Adds missed file-like object handling in detect_file_encoding
  • Adds functionality to extract charset info from eml files

Features

  • Added coordinate system class to track coordinate types and convert to different coordinate

Fixes

  • Adds an html_assemble_articles kwarg to partition_html to enable users to capture
    control whether content outside of <article> tags is captured when
    <article> tags are present.
  • Check for the xml attribute on element before looking for pagebreaks in partition_docx.