Skip to content

Releases: Unstructured-IO/unstructured-ingest

0.0.4

19 Aug 15:42
78af563
Compare
Choose a tag to compare

Enhancements

  • Add Couchbase Destination Connector Adds support for storing artifacts in Couchbase DB for Vector Search
  • Leverage pydantic base models All user-supplied configs are now derived from pydantic base models to leverage better type checking and add built in support for sensitive fields.
  • Autogenerate click options from base models Leverage th pydantic base models for all configs to autogenerate teh cli options exposed when running ingest as a CLI.
  • Drop required Unstructured dependency Unstructured was moved to an extra dependency to only be imported when needed for functionality such as local partitioning/chunking.
  • Rebrand Astra to Astra DB The Astra DB integration was re-branded to be consistent with DataStax standard branding.

0.0.3

08 Aug 12:47
2989a45
Compare
Choose a tag to compare

Enhancements

  • Improve documentation Update the README's.
  • Explicit Opensearch classes For the connector registry entries for opensearch, use only opensearch specific classes rather than any elasticsearch ones.
  • Add missing fsspec destination precheck check connection in precheck for all fsspec-based destination connectors

0.0.2

01 Aug 20:29
8d37b4e
Compare
Choose a tag to compare

Enhancements

  • Use uuid for s3 identifiers Update unique id to use uuid derived from file path rather than the filepath itself.
  • V2 connectors precheck support All steps in the v2 pipeline support an optional precheck call, which encompasses the previous check connection functionality.
  • Filter Step Support dedicated step as part of the pipeline to filter documents.

0.0.1

31 Jul 14:49
71d226f
Compare
Choose a tag to compare
Remove references to old repo (#6)