Skip to content

0.5.12

Compare
Choose a tag to compare
@MthwRobinson MthwRobinson released this 12 Apr 18:48
· 1406 commits to main since this release
e2e473d

0.5.12

Enhancements

  • Add OS mimetypes DB to docker image, mainly for unstructured-api compat.
  • Use the image registry as a cache when building Docker images.
  • Adds the ability for partition_text to group together broken paragraphs.

Features

  • Add --partition-by-api parameter to unstructured-ingest
  • Added partition_rtf for processing rich text files.
  • partition now accepts a url kwarg in addition to file and filename.

Fixes

  • Allow encoding to be passed into replace_mime_encodings.
  • unstructured-ingest connector-specific dependencies are imported on demand.
  • unstructured-ingest --flatten-metadata supported for local connector.
  • unstructured-ingest fix runtime error when using --metadata-include.