Commit c41b70c
v0.3.2: PyMuPDF extractor, DBLP always-on for low-scoring refs
- Switch primary PDF text extractor to PyMuPDF (fitz); fixes multi-column
layout garbling (Cinkusz DOI now found, Kostka/Tran titles correct)
- Fall back to pdfminer when PyMuPDF is not installed
- Add pymupdf as a core dependency
- Fix DBLP trigger: query domain sources whenever best candidate score is
below pass threshold, not only when candidates list is empty; fixes
Leviathan (ICML) and ReAct (ICLR) reliably finding via DBLP
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>1 parent ddcc096 commit c41b70c
7 files changed
Lines changed: 36 additions & 5 deletions
File tree
- citesentry
- __pycache__
- checks
- __pycache__
- parse
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | | - | |
| 3 | + | |
Binary file not shown.
Binary file not shown.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
163 | 163 | | |
164 | 164 | | |
165 | 165 | | |
166 | | - | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
167 | 170 | | |
168 | 171 | | |
169 | 172 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
20 | 29 | | |
21 | 30 | | |
22 | 31 | | |
23 | 32 | | |
24 | | - | |
| 33 | + | |
25 | 34 | | |
26 | 35 | | |
27 | 36 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | | - | |
| 7 | + | |
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
| |||
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
| 23 | + | |
23 | 24 | | |
24 | 25 | | |
25 | 26 | | |
| |||
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.
0 commit comments