Skip to content

Commit 389dbb6

Browse files
MthwRobinsonquedahmetmeleq
authored
fix: add missing dep files to manifest (#2516)
### Summary Closes #2484. Adds missing dependency files to `MANIFEST.in` so they are included in the Python distribution. Also updates the manifest to look for ingest dependencies in the `requirements/ingest` subdirectory. --------- Co-authored-by: qued <[email protected]> Co-authored-by: Ahmet Melek <[email protected]>
1 parent ccf0477 commit 389dbb6

File tree

2 files changed

+57
-22
lines changed

2 files changed

+57
-22
lines changed

Diff for: CHANGELOG.md

+3-1
Original file line numberDiff line numberDiff line change
@@ -19,10 +19,12 @@
1919
* **Handle common incorrect arguments for `languages` and `ocr_languages`** Users are regularly receiving errors on the API because they are defining `ocr_languages` or `languages` with additional quotationmarks, brackets, and similar mistakes. This update handles common incorrect arguments and raises an appropriate warning.
2020
* **Default `hi_res_model_name` now relies on `unstructured-inference`** When no explicit `hi_res_model_name` is passed into `partition` or `partition_pdf_or_image` the default model is picked by `unstructured-inference`'s settings or os env variable `UNSTRUCTURED_HI_RES_MODEL_NAME`; it now returns the same model name regardless of `infer_table_structure`'s value; this function will be deprecated in the future and the default model name will simply rely on `unstructured-inference` and will not consider os env in a future release.
2121
* **Fix remove Vectara requirements from setup.py - there are no dependencies **
22+
* **Add missing dependency files to package manifest**. Updates the file path for the ingest
23+
dependencies and adds missing extra dependencies.
24+
* **Fix remove Vectara requirements from setup.py - there are no dependencies **
2225
* **Add title to Vectara upload - was not separated out from initial connector **
2326
* **Fix change OpenSearch port to fix potential conflict with Elasticsearch in ingest test**
2427

25-
2628
## 0.12.3
2729

2830
### Enhancements

Diff for: MANIFEST.in

+54-21
Original file line numberDiff line numberDiff line change
@@ -1,23 +1,56 @@
1+
# Base requirements and constraints
12
include requirements/base.in
3+
include requirements/constraints.in
4+
5+
# Unstructured library extras
6+
include requirements/extra-csv.in
7+
include requirements/extra-docx.in
8+
include requirements/extra-epub.in
9+
include requirements/extra-markdown.in
10+
include requirements/extra-msg.in
11+
include requirements/extra-odt.in
12+
include requirements/extra-paddleocr.in
13+
include requirements/extra-pandoc.in
14+
include requirements/extra-pdf-image.in
15+
include requirements/extra-pptx.in
16+
include requirements/extra-xlsx.in
217
include requirements/huggingface.in
3-
include requirements/ingest-s3.in
4-
include requirements/ingest-azure.in
5-
include requirements/ingest-discord.in
6-
include requirements/ingest-github.in
7-
include requirements/ingest-gitlab.in
8-
include requirements/ingest-reddit.in
9-
include requirements/ingest-notion.in
10-
include requirements/ingest-slack.in
11-
include requirements/ingest-wikipedia.in
12-
include requirements/ingest-google-drive.in
13-
include requirements/ingest-gcs.in
14-
include requirements/ingest-elasticsearch.in
15-
include requirements/ingest-opensearch.in
16-
include requirements/ingest-dropbox.in
17-
include requirements/ingest-box.in
18-
include requirements/ingest-onedrive.in
19-
include requirements/ingest-outlook.in
20-
include requirements/ingest-confluence.in
21-
include requirements/ingest-airtable.in
22-
include requirements/ingest-sharepoint.in
23-
include requirements/ingest-mongodb.in
18+
19+
# Ingest extras
20+
include requirements/ingest/airtable.in
21+
include requirements/ingest/azure-cognitive-search.in
22+
include requirements/ingest/azure.in
23+
include requirements/ingest/biomed.in
24+
include requirements/ingest/box.in
25+
include requirements/ingest/chroma.in
26+
include requirements/ingest/confluence.in
27+
include requirements/ingest/databricks-volumes.in
28+
include requirements/ingest/delta-table.in
29+
include requirements/ingest/discord.in
30+
include requirements/ingest/dropbox.in
31+
include requirements/ingest/elasticsearch.in
32+
include requirements/ingest/embed-aws-bedrock.in
33+
include requirements/ingest/embed-huggingface.in
34+
include requirements/ingest/embed-openai.in
35+
include requirements/ingest/gcs.in
36+
include requirements/ingest/github.in
37+
include requirements/ingest/gitlab.in
38+
include requirements/ingest/google-drive.in
39+
include requirements/ingest/hubspot.in
40+
include requirements/ingest/jira.in
41+
include requirements/ingest/mongodb.in
42+
include requirements/ingest/notion.in
43+
include requirements/ingest/onedrive.in
44+
include requirements/ingest/opensearch.in
45+
include requirements/ingest/outlook.in
46+
include requirements/ingest/pinecone.in
47+
include requirements/ingest/postgres.in
48+
include requirements/ingest/qdrant.in
49+
include requirements/ingest/reddit.in
50+
include requirements/ingest/s3.in
51+
include requirements/ingest/salesforce.in
52+
include requirements/ingest/sftp.in
53+
include requirements/ingest/sharepoint.in
54+
include requirements/ingest/slack.in
55+
include requirements/ingest/weaviate.in
56+
include requirements/ingest/wikipedia.in

0 commit comments

Comments
 (0)