Commit 33bc432
* feat: Refactor assessment logic to separate journal and conference evaluation (fixes #75)
## Key Changes
### 1. Architectural Refactoring
- Split assessment into separate journal and conference pipelines
- Added _analyze_conference_patterns() method for conference-specific logic
- Added _analyze_journal_patterns() method (refactored from original)
- Shared _calculate_base_metrics() for common calculations
### 2. Conference-Specific Assessment Logic
- Created _check_conference_green_flags() with conference-optimized thresholds:
* Citation ratios: 50+ (excellent), 20+ (good)
* Impact thresholds: 100k+ citations (high), 20k+ (significant)
* Publication volume: 1000+ (major), 100+ (established)
- Created _check_conference_red_flags() with appropriate red flags:
* Very low citation ratios (<0.5) for conferences with 50+ papers
* Conference discontinued only after 15+ years (vs 3 for journals)
* Suspicious volume thresholds adjusted for conference patterns
### 3. Improved OpenAlex Source Matching
- Enhanced _score_source_match() with better conference handling
- Penalizes single-year conference instances less harshly
- Prioritizes high-impact, well-established venues
- Filters out very low-quality sources (≤2 papers)
### 4. Source Type Detection
- Added source_type field to OpenAlex data enrichment
- Automatic routing: conferences → conference assessment, others → journal assessment
- Publication type included in backend result data
## Test Results
### Before Fix:
- CVPR: SUSPICIOUS (0.68)
- ICCV: SUSPICIOUS (0.68)
- NeurIPS: SUSPICIOUS (0.60)
- IJCAI: SUSPICIOUS (0.60)
### After Fix:
- CVPR: LEGITIMATE (0.82) ✅
- ICCV: LEGITIMATE (0.82) ✅
- NeurIPS: UNKNOWN (0.20) ✅ (no longer falsely flagged)
- IJCAI: UNKNOWN (0.20) ✅ (no longer falsely flagged)
## Impact
- ✅ Eliminates false positives for top-tier conferences
- ✅ Preserves predatory detection capabilities
- ✅ Provides foundation for conference-specific data sources
- ✅ Maintains backward compatibility for journal assessments
## Related Issues
- Fixes #75: Fix suspicious classification of legitimate top-tier venues
- Foundation for #76: Conference name normalization improvements
- Foundation for #77: OpenAlex conference scoring enhancements
- Foundation for #78: Conference series matching
- Foundation for #79: Additional conference data source integration
* fix: Address quality check issues
- Remove unused variable in conference green flags function
- Fix OpenAlex test with realistic mock data for new scoring algorithm
- Remove debug files without SPDX headers
- Format code with ruff
* Remove unrelated files from PR
- Remove scripts/post-pr-merge.sh (unrelated script)
- Remove tmp/ files (temporary test data)
- Keep only the core conference assessment refactoring changes
---------
Co-authored-by: florath-ai-assistant[bot] <Andreas.Florath@telekom.de>
1 parent 81405d8 commit 33bc432
File tree
3 files changed
+245
-21
lines changed- src/aletheia_probe
- backends
- tests/unit
3 files changed
+245
-21
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
78 | 78 | | |
79 | 79 | | |
80 | 80 | | |
81 | | - | |
82 | | - | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
83 | 88 | | |
84 | 89 | | |
85 | 90 | | |
| |||
92 | 97 | | |
93 | 98 | | |
94 | 99 | | |
| 100 | + | |
95 | 101 | | |
96 | 102 | | |
97 | 103 | | |
| |||
115 | 121 | | |
116 | 122 | | |
117 | 123 | | |
118 | | - | |
| 124 | + | |
119 | 125 | | |
120 | 126 | | |
121 | 127 | | |
122 | 128 | | |
123 | 129 | | |
124 | 130 | | |
125 | 131 | | |
126 | | - | |
127 | | - | |
| 132 | + | |
| 133 | + | |
128 | 134 | | |
129 | | - | |
130 | | - | |
| 135 | + | |
| 136 | + | |
131 | 137 | | |
132 | | - | |
133 | | - | |
| 138 | + | |
| 139 | + | |
134 | 140 | | |
135 | 141 | | |
136 | 142 | | |
| |||
146 | 152 | | |
147 | 153 | | |
148 | 154 | | |
149 | | - | |
| 155 | + | |
150 | 156 | | |
151 | 157 | | |
152 | | - | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
153 | 191 | | |
154 | 192 | | |
155 | 193 | | |
| |||
198 | 236 | | |
199 | 237 | | |
200 | 238 | | |
| 239 | + | |
201 | 240 | | |
202 | 241 | | |
203 | | - | |
204 | | - | |
| 242 | + | |
| 243 | + | |
205 | 244 | | |
206 | 245 | | |
207 | 246 | | |
| |||
250 | 289 | | |
251 | 290 | | |
252 | 291 | | |
253 | | - | |
| 292 | + | |
254 | 293 | | |
255 | 294 | | |
256 | 295 | | |
257 | 296 | | |
258 | 297 | | |
259 | 298 | | |
260 | 299 | | |
261 | | - | |
262 | | - | |
| 300 | + | |
| 301 | + | |
263 | 302 | | |
264 | 303 | | |
265 | 304 | | |
| |||
317 | 356 | | |
318 | 357 | | |
319 | 358 | | |
320 | | - | |
| 359 | + | |
321 | 360 | | |
322 | 361 | | |
323 | 362 | | |
| |||
331 | 370 | | |
332 | 371 | | |
333 | 372 | | |
| 373 | + | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
| 388 | + | |
| 389 | + | |
| 390 | + | |
| 391 | + | |
| 392 | + | |
| 393 | + | |
| 394 | + | |
| 395 | + | |
| 396 | + | |
| 397 | + | |
| 398 | + | |
| 399 | + | |
| 400 | + | |
| 401 | + | |
| 402 | + | |
| 403 | + | |
| 404 | + | |
| 405 | + | |
| 406 | + | |
| 407 | + | |
| 408 | + | |
| 409 | + | |
| 410 | + | |
| 411 | + | |
| 412 | + | |
| 413 | + | |
| 414 | + | |
| 415 | + | |
| 416 | + | |
| 417 | + | |
| 418 | + | |
| 419 | + | |
| 420 | + | |
| 421 | + | |
| 422 | + | |
| 423 | + | |
| 424 | + | |
| 425 | + | |
| 426 | + | |
| 427 | + | |
| 428 | + | |
| 429 | + | |
| 430 | + | |
| 431 | + | |
| 432 | + | |
| 433 | + | |
| 434 | + | |
| 435 | + | |
| 436 | + | |
| 437 | + | |
| 438 | + | |
| 439 | + | |
| 440 | + | |
| 441 | + | |
| 442 | + | |
| 443 | + | |
| 444 | + | |
| 445 | + | |
| 446 | + | |
| 447 | + | |
| 448 | + | |
| 449 | + | |
| 450 | + | |
| 451 | + | |
| 452 | + | |
| 453 | + | |
| 454 | + | |
| 455 | + | |
| 456 | + | |
| 457 | + | |
| 458 | + | |
| 459 | + | |
| 460 | + | |
| 461 | + | |
| 462 | + | |
| 463 | + | |
| 464 | + | |
| 465 | + | |
| 466 | + | |
| 467 | + | |
| 468 | + | |
| 469 | + | |
334 | 470 | | |
335 | 471 | | |
336 | 472 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
93 | 93 | | |
94 | 94 | | |
95 | 95 | | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
96 | 160 | | |
97 | | - | |
| 161 | + | |
98 | 162 | | |
99 | 163 | | |
100 | 164 | | |
| |||
117 | 181 | | |
118 | 182 | | |
119 | 183 | | |
120 | | - | |
121 | | - | |
122 | | - | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
123 | 205 | | |
124 | 206 | | |
125 | 207 | | |
| |||
266 | 348 | | |
267 | 349 | | |
268 | 350 | | |
| 351 | + | |
269 | 352 | | |
270 | 353 | | |
271 | 354 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
87 | 87 | | |
88 | 88 | | |
89 | 89 | | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
90 | 95 | | |
91 | 96 | | |
92 | 97 | | |
| |||
0 commit comments