Skip to content

Commit e9c2f60

Browse files
davanstrienclaude
andcommitted
Add screenshots to README: ELO scatter chart and leaderboard viewer
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
1 parent 0ab2ca7 commit e9c2f60

File tree

4 files changed

+5
-0
lines changed

4 files changed

+5
-0
lines changed

.gitignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,7 @@ dist/
66

77
# Screenshots and temp files from dev/testing
88
*.png
9+
!assets/*.png
910
*.jpeg
1011
*.json
1112
!pyproject.toml

README.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,8 @@ ocr-bench lets you run the same set of OCR models on a sample of _your_ collecti
2222

2323
Rankings can flip completely between collections.
2424

25+
![ELO vs Parameter Count — smaller models can win on the right documents](assets/elo-scatter.png)
26+
2527
## Hub-native by design
2628

2729
The entire evaluation loop lives on the Hugging Face Hub:
@@ -79,6 +81,8 @@ ocr-bench run <dataset> <output> --models glm-ocr lighton-ocr-2
7981

8082
## Example results
8183

84+
![Leaderboard viewer with ELO ratings, confidence intervals, and human validation](assets/leaderboard.png)
85+
8286
Browse these on the Hub:
8387

8488
- [davanstrien/ocr-bench-britannica-results-qwen35](https://huggingface.co/datasets/davanstrien/ocr-bench-britannica-results-qwen35) — Encyclopaedia Britannica 1771, 5 models, 50 samples

assets/elo-scatter.png

86.9 KB
Loading

assets/leaderboard.png

189 KB
Loading

0 commit comments

Comments
 (0)