Skip to content

Commit eb6cf2b

Browse files
caitlinwheelesscaitlinwheeless
and
caitlinwheeless
authored
docs: DOC-263: Flexible labeling config for Prompts (#6639)
Co-authored-by: caitlinwheeless <[email protected]>
1 parent 7e2dc79 commit eb6cf2b

File tree

3 files changed

+24
-26
lines changed

3 files changed

+24
-26
lines changed

docs/source/guide/prompts_create.md

+15-9
Original file line numberDiff line numberDiff line change
@@ -108,25 +108,18 @@ From the Prompts page, click **Create Prompt** in the upper right and then compl
108108
| --- | --- |
109109
| Name | Enter a name for the Prompt. |
110110
| Description | Enter a description for the Prompt. |
111-
| Type | Select the Prompt model type: [Text Classification](#Text-classification) or [Named Entity Recognition](#Named-entity-recognition-NER) |
112-
| Target Project| Select the project you want to use. If you don't have any eligible projects, you will see an error message. <br><br>See the note below. |
113-
| Classes | This list is automatically generated from the labeling configuration of the target project. |
111+
| Target Project| Select the project you want to use. If you don't have any eligible projects, you will see an error message. <br>See the note below. <br><br>When you select a project, additional information about the labeling config appears. This includes the classes that will be used when applying the prompt. |
114112

115113
</div>
116114

117115
!!! note Eligible projects
118116
Target projects must meet the following criteria:
119-
* The labeling configuration for the project must be compatible with the **Type** you selected above:
120-
* For text classification, this means that the labeling configuration for the project must use `Choice` tags.
121-
* For NER, this means that the labeling configuration for the project must use `Label` tags.
122-
* The project must have one output type (`Choice` or `Label`) and not a mix of both.
123-
* The project cannot include multiple `Choices` or `Labels` blocks in its labeling configuration.
124117
* The project must include text data. While it can include other data types such as images or video, it must include `<Text>`.
125118
* You must have access to the project. If you are in the Manager role, you need to be added to the project to have access.
126119
* The project cannot be located in your Personal Sandbox workspace.
127120
* While projects connected to an ML backend will still appear in the list of eligible projects, we do not recommend using Prompts with an ML backend as this can interfere with how accuracy and score are calculated when evaluating the prompt.
128121

129-
## Types
122+
## Example project types
130123

131124
### Text classification
132125

@@ -154,3 +147,16 @@ Some examples include:
154147
* **Intelligence and Risk Analysis**: By extracting entities such as personal names, organizations, IP addresses, and financial transactions from suspicious activity reports or communications, organizations can better assess risks and detect fraud or criminal activity.
155148
* **Specialized Document Review**: Once trained, NER can help extract industry-specific key entities for better document review, searching, and classification.
156149
* **Customer Feedback and Product Review**: Extract named entities like product names, companies, or services from customer feedback or reviews. This allows businesses to categorize and analyze feedback based on specific products, people, or regions, helping them make data-driven improvements.
150+
151+
### Text summarization
152+
153+
Text summarization involves condensing large amounts of information into concise, meaningful summaries.
154+
155+
Models can be trained or fine-tuned to recognize essential information within a document and generate summaries that retain the core ideas while omitting less critical details. This capability is especially valuable in today’s information-heavy landscape, where professionals across various fields are often overwhelmed by the sheer volume of text data.
156+
157+
Some examples include:
158+
159+
* **Customer Support and Feedback Analysis**: Companies receive vast volumes of customer support tickets, reviews, and feedback that are often repetitive or lengthy. Auto-labeling can help summarize these inputs, focusing on core issues or themes, such as “billing issues” or “technical support.”
160+
* **News Aggregation and Media Monitoring**: News organizations and media monitoring platforms need to process and distribute news stories efficiently. Auto-labeling can summarize articles while tagging them with labels like “politics,” “economy,” or “health,” making it easier for users to find relevant stories.
161+
* **Document Summarization**: Professionals often need to quickly understand the key points in lengthy contracts, research papers, and files.
162+
* **Educational Content Summarization**: EEducators and e-learning platforms need to distill complex material into accessible summaries for students. Auto-labeling can summarize key topics and categorize them under labels like “concept,” “example,” or “important fact.”

docs/source/guide/prompts_draft.md

+6-14
Original file line numberDiff line numberDiff line change
@@ -60,7 +60,7 @@ When you evaluate a prompt, you will see the following metrics:
6060
<tr>
6161
<th>Metric</th>
6262
<th>Tasks</th>
63-
<th>Type</th>
63+
<th>Config</th>
6464
<th>Description</th>
6565
</tr>
6666
</thead>
@@ -76,9 +76,7 @@ Ground Truths
7676
</td>
7777
<td>
7878

79-
Text classification
80-
81-
NER
79+
All labeling configs
8280

8381
</td>
8482
<td>
@@ -101,9 +99,7 @@ All task types
10199
</td>
102100
<td>
103101

104-
Text classification
105-
106-
NER
102+
All labeling configs
107103

108104
</td>
109105
<td>
@@ -124,9 +120,9 @@ Ground Truths
124120
</td>
125121
<td>
126122

127-
Text classification
123+
Single `<Choices>`
128124

129-
NER
125+
Single `<Labels>`
130126

131127
</td>
132128
<td>
@@ -159,9 +155,7 @@ All task types
159155
</td>
160156
<td>
161157

162-
Text classification
163-
164-
NER
158+
All labeling configs
165159

166160
</td>
167161
<td>
@@ -195,8 +189,6 @@ Auto-refinement applies your initial prompt and the Teacher Model to generate pr
195189
Your Teacher Model evaluates the initial prompt’s predictions against the ground truth (or sample task output) and identifies areas for improvement. It then suggests a refined prompt, aimed at achieving closer alignment with the desired outcomes.
196190

197191

198-
199-
200192
## Drafting effective prompts
201193

202194
For a comprehensive guide to drafting prompts, see [The Prompt Report: A Systematic Survey of Prompting Techniques](https://arxiv.org/abs/2406.06608) or OpenAI's guide to [Prompt Engineering](https://platform.openai.com/docs/guides/prompt-engineering).

docs/source/guide/prompts_overview.md

+3-3
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,6 @@ With Prompts, you can:
2626

2727
## Use cases
2828

29-
3029
### Auto-labeling with Prompts
3130

3231
Prompts allows you to leverage LLMs to swiftly generate accurate predictions, enabling instant labeling of thousands of tasks.
@@ -133,9 +132,10 @@ This feedback loop allows you to iteratively fine-tune your prompts, optimizing
133132
| Feature | Support |
134133
| --- | --- |
135134
| **Supported data types** | Text |
136-
| **Supported model types** | Text Classification <br>Named Entity Recognition (NER) |
135+
| **Supported object tags** | `Text` <br>`HyperText` |
136+
| **Supported control tags** | `Choices` <br>`Labels` <br>`TextArea` <br>`Pairwise` |
137137
| **Class selection** | Multi-selection (the LLM can apply multiple labels per task)|
138-
| **Supported base models** | OpenAI gpt-3.5-turbo-16k <br>OpenAI gpt-3.5-turbo-instruct* <br>OpenAI gpt-4-turbo <br>OpenAI gpt-3.5-turbo <br>OpenAI gpt-4o <br>OpenAI gpt-4<br>[Azure OpenAI chat-based models](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models)<br>[Custom LLM](prompts_create#Add-OpenAI-Azure-OpenAI-or-a-custom-model) |
138+
| **Supported base models** | OpenAI gpt-3.5-turbo-16k* <br>OpenAI gpt-3.5-turbo* <br>OpenAI gpt-4 <br>OpenAI gpt-4-turbo <br>OpenAI gpt-4o <br>OpenAI gpt-4o-mini<br>[Azure OpenAI chat-based models](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models)<br>[Custom LLM](prompts_create#Add-OpenAI-Azure-OpenAI-or-a-custom-model)<br><br>**Note:** We recommend against using GPT 3.5 models, as these can sometimes be prone to rate limit errors. |
139139
| **Text compatibility** | Task text must be utf-8 compatible |
140140
| **Task size** | Total size of each task can be no more than 1MB (approximately 200-500 pages of text) |
141141
| **Network access** | If you are using a firewall or restricting network access to your OpenAI models, you will need to allow the following IPs: <br>3.219.3.197 <br>34.237.73.3 <br>4.216.17.242 |

0 commit comments

Comments
 (0)