You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: backend/src/llm/prompts.py
+5-5Lines changed: 5 additions & 5 deletions
Original file line number
Diff line number
Diff line change
@@ -85,11 +85,11 @@ def build_intent_extraction_prompt(user_message: str, conversation_history: list
85
85
- "summarize document" or "summarization" → use_case: summarization_short or long_document_summarization
86
86
87
87
Priority extraction (for scoring weights - use "medium" as baseline, adjust based on context):
88
-
- accuracy_priority: "high" if accuracy/quality seems important for the use case, "low" if user indicates good-enough is fine
89
-
- cost_priority: Default to "medium". Only set "high" if user explicitly mentions budget constraints or cost sensitivity. Only set "low" if user explicitly says cost doesn't matter or budget is unlimited.
90
-
- latency_priority: "high" if the use case requires fast responses (e.g., real-time, interactive), "low" if async/batch is acceptable
91
-
- complexity_priority: "high" if user wants simple deployment, "low" if they're okay with complex setups
92
-
Explicit user statements override inferences (e.g., "cost is critical" → cost_priority: high)
88
+
- accuracy_priority: "high" if user mentions accuracy matters, quality is important, accuracy is critical, best model, or top quality. "low" if user says goodenough or accuracy less important.
89
+
- cost_priority: "high" if user mentions cost-effective, cost-sensitive, budget constrained, minimize cost, cost is important, or budget is tight. "low" if user says cost doesn't matter or budget is unlimited. Default to "medium" if not mentioned.
90
+
- latency_priority: "high" if the use case requires fast responses (e.g., real-time, interactive, instant). "low" if async/batch is acceptable.
91
+
- complexity_priority: "high" if user wants simple deployment, easy setup. "low" if they're okay with complex setups.
0 commit comments