Skip to content

Commit cfea831

Browse files
chore(web): wire vercel deploy — static export, tracked generated JSONs
vercel.json runs `next build` directly (skipping pnpm content) so deploys work from a clean clone where the gitignored eval data is absent. The generated JSONs are now tracked and the content script preserves them when sources are missing instead of clobbering with empty fallbacks. Pinned pnpm + node engines for reproducible Vercel builds.
1 parent 17762ce commit cfea831

11 files changed

Lines changed: 3423 additions & 22 deletions

.gitignore

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -82,7 +82,6 @@ Thumbs.db
8282
web/node_modules/
8383
web/.next/
8484
web/out/
85-
web/data/*.generated.json
8685
web/playwright-report/
8786
web/test-results/
8887
web/lh.json

web/.gitignore

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,6 @@ node_modules/
22
.next/
33
out/
44
*.tsbuildinfo
5-
data/*.generated.json
65
.env*.local
76
playwright-report/
87
test-results/
Lines changed: 52 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,52 @@
1+
[
2+
{
3+
"name": "v4_mixed_warm",
4+
"temperature": 0.2,
5+
"top_p": 0.9,
6+
"repetition_penalty": 1.1,
7+
"no_repeat_ngram_size": 0,
8+
"seed": 4040,
9+
"uiLabel": "P-1",
10+
"isDefault": false
11+
},
12+
{
13+
"name": "v4_mixed_warmer",
14+
"temperature": 0.15,
15+
"top_p": 0.9,
16+
"repetition_penalty": 1.125,
17+
"no_repeat_ngram_size": 0,
18+
"seed": 4141,
19+
"uiLabel": "P-2",
20+
"isDefault": false
21+
},
22+
{
23+
"name": "v5_seed_a",
24+
"temperature": 0.1,
25+
"top_p": 0.9,
26+
"repetition_penalty": 1.15,
27+
"no_repeat_ngram_size": 0,
28+
"seed": 5151,
29+
"uiLabel": "P-3",
30+
"isDefault": true
31+
},
32+
{
33+
"name": "v5_seed_b",
34+
"temperature": 0.1,
35+
"top_p": 0.9,
36+
"repetition_penalty": 1.15,
37+
"no_repeat_ngram_size": 0,
38+
"seed": 5252,
39+
"uiLabel": "P-4",
40+
"isDefault": false
41+
},
42+
{
43+
"name": "v5_seed_c",
44+
"temperature": 0.1,
45+
"top_p": 0.9,
46+
"repetition_penalty": 1.15,
47+
"no_repeat_ngram_size": 0,
48+
"seed": 5353,
49+
"uiLabel": "P-5",
50+
"isDefault": false
51+
}
52+
]

web/data/failures.generated.json

Lines changed: 62 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,62 @@
1+
[
2+
{
3+
"num": "Case 01",
4+
"type": "Hallucinated number",
5+
"caughtBy": "Caught by Grounder",
6+
"headline": [
7+
{
8+
"text": "A claim with "
9+
},
10+
{
11+
"text": "no Japanese span",
12+
"em": true
13+
},
14+
{
15+
"text": " gets refused."
16+
}
17+
],
18+
"claim": "Pass-2 drafted a sentence asserting a revenue figure that no Pass-1 span backed. The Citation-Grounder replaces the sentence with [evidence insufficient].",
19+
"outputBlock": "draft → \"FY25 revenue is forecast at ¥12.4 trillion.\"\nground → [evidence insufficient]",
20+
"customId": "fraud_detection-00467"
21+
},
22+
{
23+
"num": "Case 02",
24+
"type": "Ambiguous span",
25+
"caughtBy": "Resolved by Critic",
26+
"headline": [
27+
{
28+
"text": "Two candidates disagree — the "
29+
},
30+
{
31+
"text": "judge picks the tighter span",
32+
"em": true
33+
},
34+
{
35+
"text": "."
36+
}
37+
],
38+
"claim": "Two of five decoder profiles cited adjacent Japanese spans that overlap. The bo-5 judge selected the candidate whose citations matched a unique Pass-1 span.",
39+
"outputBlock": "v4_mixed_warm → score 3 (overlap)\nv5_seed_a → score 4 ★ (unique span)",
40+
"customId": "fraud_detection-00580"
41+
},
42+
{
43+
"num": "Case 03",
44+
"type": "Contradictory signal",
45+
"caughtBy": "Logged · escalated",
46+
"headline": [
47+
{
48+
"text": "OCF up while DSO stretches — "
49+
},
50+
{
51+
"text": "flagged, not muted",
52+
"em": true
53+
},
54+
{
55+
"text": "."
56+
}
57+
],
58+
"claim": "Pass-1 detected a positive operating-cash-flow swing alongside DSO drift. The memo surfaces the tension as a risk note rather than smoothing it away.",
59+
"outputBlock": "OCF +2.1B (positive)\nDSO +6 days (negative)\nRisk note ★ kept",
60+
"customId": "industry_prediction_v2-00119"
61+
}
62+
]

0 commit comments

Comments
 (0)