Skip to content

Commit 3954503

Browse files
authored
Update Pinecone index embeddings behaviors, remove outdated how-to video (#777)
1 parent f0bb85a commit 3954503

File tree

3 files changed

+43
-42
lines changed

3 files changed

+43
-42
lines changed

examplecode/tools/vectorshift.mdx

Lines changed: 0 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -13,16 +13,6 @@ enables you to use GPT-4o-mini to chat in real time with a PDF document that is
1313

1414
## Prerequisites
1515

16-
<iframe
17-
width="560"
18-
height="315"
19-
src="https://www.youtube.com/embed/Li0yhaeguYQ"
20-
title="YouTube video player"
21-
frameborder="0"
22-
allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture"
23-
allowfullscreen
24-
></iframe>
25-
2616
import PineconeShared from '/snippets/general-shared-text/pinecone.mdx';
2717
import GetStartedSimpleAPIOnly from '/snippets/general-shared-text/get-started-simple-api-only.mdx';
2818

Lines changed: 42 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -1,29 +1,28 @@
11
- A Pinecone account. [Get an account](https://app.pinecone.io/).
2+
- A Pinecone API key in your Pinecone account. To create an API key, do the following:
23

3-
<iframe
4-
width="560"
5-
height="315"
6-
src="https://www.youtube.com/embed/uPoa7PLcuZ4"
7-
title="YouTube video player"
8-
frameborder="0"
9-
allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture"
10-
allowfullscreen
11-
></iframe>
4+
1. [Sign in to your Pinecone account and open the API keys page](https://app.pinecone.io/organizations/-/projects/-/keys).
5+
2. Click **+ API key**.
6+
3. For **API key name**, enter some descriptive name for the API key.
7+
4. Click **Create key**.
8+
5. Copy the generated API key to some secure location. You will not be able to access this API key again after you close the dialog.
129

13-
- A Pinecone API key. [Get an API key](https://docs.pinecone.io/guides/get-started/authentication#find-your-pinecone-api-key).
14-
- A Pinecone serverless index. [Create a serverless index](https://docs.pinecone.io/guides/indexes/create-an-index).
10+
- A Pinecone serverless index in your Pinecone account.
1511

16-
An existing index is not required. At runtime, the index behavior is as follows:
12+
Creating a serverless index on your own is optional.
13+
An index is not required to exist in advance.
14+
15+
When you set up the connector, at runtime, the index behavior is as follows:
1716

1817
For the [Unstructured UI](/ui/overview) and [Unstructured API](/api-reference/overview):
19-
20-
- If an existing index name is specified, and Unstructured generates embeddings,
21-
but the number of dimensions that are generated does not match the existing index's embedding settings, the run will fail.
22-
You must change your Unstructured embedding settings or your existing index's embedding settings to match, and try the run again.
23-
- If an index name is not specified, Unstructured creates a new index in your Pinecone account. If Unstructured generates embeddings,
24-
the new index's name will be `u<short-workflow-id>-<short-embedding-model-name>-<number-of-dimensions>`.
25-
If Unstructured does not generate embeddings, the new index's name will be `u<short-workflow-id`.
2618

19+
- Your workflow must contain an embedder node, and the embedder node must specify the embedding model that Unstructured will use to generate the embeedings.
20+
- If an existing index name is specified,
21+
and the number of dimensions that Unstructured generates does not match the number of dimensions that is specified in the existing index's embedding settings, the run will fail.
22+
You must change the number of dimensions in your workflow's embedder node or your existing index's embedding settings to match, and try the run again.
23+
- If an index name is not specified, Unstructured creates a new index in your Pinecone account. The
24+
new index's name will be `u<short-workflow-id>-<short-embedding-model-name>-<number-of-dimensions>`.
25+
2726
For [Unstructured Ingest](/open-source/ingestion/overview):
2827

2928
- If an existing index name is specified, and Unstructured generates embeddings,
@@ -38,4 +37,28 @@
3837
might be written to the index or, in some cases, the operation could fail altogether.
3938
</Note>
4039

40+
To create a serverless index on your own, do the following:
41+
42+
1. [Sign in to your Pinecone account and open the Create a new index page](https://app.pinecone.io/organizations/-/projects/-/create-index/serverless).
43+
2. For **Enter index name**, enter some descriptive name for the index.
44+
3. For **Configuration**, select the check box labelled **Custom settings**, or click the tile labelled **Manual configuration**.
45+
46+
<Warning>
47+
Do not click any of the other tiles, such as **text-embedding-3-large**. Clicking any of these other tiles will cause Pinecone to generate embeddings instead of
48+
having Unstructured generate them. If Pinecone generates embeddings instead of Unstructured, this could cause any related Unstructured workflows to fail.
49+
</Warning>
50+
51+
4. For **Vector type**, select **Dense**.
52+
5. For **Dimension**, enter the number of dimensions for the embeddings that Unstructured will generate.
53+
54+
<Warning>
55+
The number of dimensions that you enter here must match the number of dimensions for the embedding model that you use in any related Unstructured workflows or ingestion pipelines. If these numbers do not
56+
match in both places, this could cause any related Unstructured workflows or ingestion pipelines to fail.
57+
</Warning>
58+
59+
6. For **Metric**, select **cosine**.
60+
7. Leave **Capacity mode** set to **Serverless**.
61+
8. You can leave **Cloud provider** and **Region** set to their default values, or you can select a cloud provider and region that is closest to you, if available.
62+
9. Click **Create index**.
63+
4164
- Within a Pinecone serverless index, custom [namespaces](https://docs.pinecone.io/guides/index-data/indexing-overview#namespaces) are supported but are not required.

ui/destinations/pinecone.mdx

Lines changed: 1 addition & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -8,19 +8,7 @@ import FirstTimeUIDestinationConnector from '/snippets/general-shared-text/first
88

99
Send processed data from Unstructured to Pinecone.
1010

11-
The following video shows how to fulfill the minimum set of Pinecone requirements:
12-
13-
<iframe
14-
width="560"
15-
height="315"
16-
src="https://www.youtube.com/embed/Li0yhaeguYQ"
17-
title="YouTube video player"
18-
frameborder="0"
19-
allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture"
20-
allowfullscreen
21-
></iframe>
22-
23-
Here are some more details about the requirements:
11+
The requirements are as follows.
2412

2513
import PineconePrerequisites from '/snippets/general-shared-text/pinecone.mdx';
2614

0 commit comments

Comments
 (0)