Skip to content

Commit 796b333

Browse files
docs: fix benchmark dataset spacing (closes #196)
1 parent 9968cd2 commit 796b333

File tree

4 files changed

+8
-0
lines changed

4 files changed

+8
-0
lines changed

README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -437,6 +437,7 @@ This benchmark tests **LLM comprehension and data retrieval accuracy** across di
437437
Eleven datasets designed to test different structural patterns and validation capabilities:
438438

439439
**Primary datasets:**
440+
440441
1. **Tabular** (100 employee records): Uniform objects with identical fields – optimal for TOON's tabular format.
441442
2. **Nested** (50 e-commerce orders): Complex structures with nested customer objects and item arrays.
442443
3. **Analytics** (60 days of metrics): Time-series data with dates and numeric values.
@@ -445,6 +446,7 @@ Eleven datasets designed to test different structural patterns and validation ca
445446
6. **Nested Config** (1 configuration): Deeply nested configuration with minimal tabular eligibility.
446447

447448
**Structural validation datasets:**
449+
448450
7. **Control**: Valid complete dataset (baseline for validation)
449451
8. **Truncated**: Array with 3 rows removed from end (tests `[N]` length detection)
450452
9. **Extra rows**: Array with 3 additional rows beyond declared length

benchmarks/results/retrieval-accuracy.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -278,6 +278,7 @@ This benchmark tests **LLM comprehension and data retrieval accuracy** across di
278278
Eleven datasets designed to test different structural patterns and validation capabilities:
279279

280280
**Primary datasets:**
281+
281282
1. **Tabular** (100 employee records): Uniform objects with identical fields – optimal for TOON's tabular format.
282283
2. **Nested** (50 e-commerce orders): Complex structures with nested customer objects and item arrays.
283284
3. **Analytics** (60 days of metrics): Time-series data with dates and numeric values.
@@ -286,6 +287,7 @@ Eleven datasets designed to test different structural patterns and validation ca
286287
6. **Nested Config** (1 configuration): Deeply nested configuration with minimal tabular eligibility.
287288

288289
**Structural validation datasets:**
290+
289291
7. **Control**: Valid complete dataset (baseline for validation)
290292
8. **Truncated**: Array with 3 rows removed from end (tests `[N]` length detection)
291293
9. **Extra rows**: Array with 3 additional rows beyond declared length

benchmarks/src/report.ts

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -284,6 +284,7 @@ This benchmark tests **LLM comprehension and data retrieval accuracy** across di
284284
Eleven datasets designed to test different structural patterns and validation capabilities:
285285
286286
**Primary datasets:**
287+
287288
1. **Tabular** (${tabularSize} employee records): Uniform objects with identical fields – optimal for TOON's tabular format.
288289
2. **Nested** (${nestedSize} e-commerce orders): Complex structures with nested customer objects and item arrays.
289290
3. **Analytics** (${analyticsSize} days of metrics): Time-series data with dates and numeric values.
@@ -292,6 +293,7 @@ Eleven datasets designed to test different structural patterns and validation ca
292293
6. **Nested Config** (${nestedConfigSize} configuration): Deeply nested configuration with minimal tabular eligibility.
293294
294295
**Structural validation datasets:**
296+
295297
7. **Control**: Valid complete dataset (baseline for validation)
296298
8. **Truncated**: Array with 3 rows removed from end (tests \`[N]\` length detection)
297299
9. **Extra rows**: Array with 3 additional rows beyond declared length

docs/guide/benchmarks.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -294,6 +294,7 @@ This benchmark tests **LLM comprehension and data retrieval accuracy** across di
294294
Eleven datasets designed to test different structural patterns and validation capabilities:
295295

296296
**Primary datasets:**
297+
297298
1. **Tabular** (100 employee records): Uniform objects with identical fields – optimal for TOON's tabular format.
298299
2. **Nested** (50 e-commerce orders): Complex structures with nested customer objects and item arrays.
299300
3. **Analytics** (60 days of metrics): Time-series data with dates and numeric values.
@@ -302,6 +303,7 @@ Eleven datasets designed to test different structural patterns and validation ca
302303
6. **Nested Config** (1 configuration): Deeply nested configuration with minimal tabular eligibility.
303304

304305
**Structural validation datasets:**
306+
305307
7. **Control**: Valid complete dataset (baseline for validation)
306308
8. **Truncated**: Array with 3 rows removed from end (tests `[N]` length detection)
307309
9. **Extra rows**: Array with 3 additional rows beyond declared length

0 commit comments

Comments
 (0)