Releases: Unstructured-IO/unstructured-ingest
Releases · Unstructured-IO/unstructured-ingest
0.0.4
Enhancements
- Add Couchbase Destination Connector Adds support for storing artifacts in Couchbase DB for Vector Search
- Leverage pydantic base models All user-supplied configs are now derived from pydantic base models to leverage better type checking and add built in support for sensitive fields.
- Autogenerate click options from base models Leverage th pydantic base models for all configs to autogenerate teh cli options exposed when running ingest as a CLI.
- Drop required Unstructured dependency Unstructured was moved to an extra dependency to only be imported when needed for functionality such as local partitioning/chunking.
- Rebrand Astra to Astra DB The Astra DB integration was re-branded to be consistent with DataStax standard branding.
0.0.3
Enhancements
- Improve documentation Update the README's.
- Explicit Opensearch classes For the connector registry entries for opensearch, use only opensearch specific classes rather than any elasticsearch ones.
- Add missing fsspec destination precheck check connection in precheck for all fsspec-based destination connectors
0.0.2
Enhancements
- Use uuid for s3 identifiers Update unique id to use uuid derived from file path rather than the filepath itself.
- V2 connectors precheck support All steps in the v2 pipeline support an optional precheck call, which encompasses the previous check connection functionality.
- Filter Step Support dedicated step as part of the pipeline to filter documents.