You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+3Lines changed: 3 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,6 +6,9 @@ The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
6
6
7
7
## [Unreleased]
8
8
9
+
### Fixed
10
+
-**Removed the Wikipedia-specific corpus statistic from the default `summarize` prompt.** The `CORPUS AWARENESS` rule shipped a hard-coded "~6,900 Featured Articles" example that described only the Wikipedia demo, reached every site using the default prompt, and taught the model to fabricate corpus counts (observed confabulating stats on unrelated demos). The example is now count-free and frames gaps via the site description's scope, and the rule explicitly forbids inventing statistics (counts, totals, sizes). The compiled WASM must be rebuilt so the client-side AI path picks up the new text. ([tag1consulting/scolta-core#33](https://github.com/tag1consulting/scolta-core/issues/33))
11
+
9
12
### Removed
10
13
-**Reverted query-word-importance scoring weight (#31).** Removed the `incidental_match_weight` config and the per-query-word importance weighting from `score_results`/`batch_score_results`. Validation showed the weighting was inert — it changed result ordering on zero real queries — so the scorer returns to counting every matched query term equally, as it did before #31.
Copy file name to clipboardExpand all lines: src/prompts.rs
+33-1Lines changed: 33 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -61,7 +61,7 @@ METADATA RULES:
61
61
GROUNDING CHECK:
62
62
- Use ONLY information from the provided excerpts. Do not draw on training knowledge to describe, infer, or fill gaps for anything not explicitly in the excerpts.
63
63
- If a detail is not in the excerpts, omit it — never estimate or invent it.
64
-
- CORPUS AWARENESS: You are searching a specific collection described above, not the entire internet or a complete knowledge base. When few or no results match the query, explain this honestly by referencing the collection scope from the site description — e.g., "This collection of ~6,900 Featured Articles doesn't include a dedicated article on [topic]" or "The [site name] covers [scope] — [topic] may fall outside that focus." Do NOT pretend the collection should have the answer. Do NOT redirect to external sources. Suggest related terms the user could try within THIS collection.
64
+
- CORPUS AWARENESS: You are searching a specific collection described above, not the entire internet or a complete knowledge base. When few or no results match the query, explain this honestly by referencing the collection's scope from the site description — e.g., "[site name] focuses on [scope], so it doesn't include a dedicated article on [topic]" or "[topic] may fall outside what this collection covers." Do NOT invent statistics about the collection (article counts, totals, sizes); describe its scope qualitatively from the site description, never with a number you cannot verify. Do NOT pretend the collection should have the answer. Do NOT redirect to external sources. Suggest related terms the user could try within THIS collection.
65
65
- When results are only tangentially related to the query, still try to help — present what the collection DOES have and extract whatever is genuinely useful. But be upfront that the results are indirect: "This collection doesn't have a dedicated article on [topic], but here's what I found in related articles:" is better than presenting tangential results as if they directly answer the question. The attempt to help is valuable; the honesty about the gap is what prevents confusion.
66
66
67
67
Tone: Direct, expert, helpful. Like a knowledgeable friend who has reviewed the options for you."#;
@@ -257,6 +257,38 @@ mod tests {
257
257
assert!(resolved.contains("Do not mention competitors."));
0 commit comments