Commit 2c04a9e
Add LaTeX escape sequence cleaning to BibTeX parser to properly handle
special characters in venue names. This fixes cases where journals like
"Computers \& Security" were not matching in backend databases due to
unprocessed escape sequences.
Changes:
- Add _clean_latex_escapes() method to handle common LaTeX escapes
- Integrate escape cleaning into _remove_nested_braces() workflow
- Handle both single and double backslash patterns
- Add comprehensive unit tests for escape sequence cleaning
Handles escape sequences: \& \' \" \{ \} \$ \% \# \_ \^ \~
[AI-assisted]
Co-authored-by: florath-ai-assistant[bot] <Andreas.Florath@telekom.de>
1 parent 13f94c0 commit 2c04a9e
File tree
2 files changed
+164
-6
lines changed- src/aletheia_probe
- tests/unit
2 files changed
+164
-6
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
356 | 356 | | |
357 | 357 | | |
358 | 358 | | |
| 359 | + | |
| 360 | + | |
| 361 | + | |
| 362 | + | |
| 363 | + | |
| 364 | + | |
| 365 | + | |
| 366 | + | |
| 367 | + | |
| 368 | + | |
| 369 | + | |
| 370 | + | |
| 371 | + | |
| 372 | + | |
| 373 | + | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
| 388 | + | |
| 389 | + | |
| 390 | + | |
| 391 | + | |
| 392 | + | |
| 393 | + | |
| 394 | + | |
| 395 | + | |
| 396 | + | |
| 397 | + | |
| 398 | + | |
| 399 | + | |
| 400 | + | |
| 401 | + | |
| 402 | + | |
| 403 | + | |
| 404 | + | |
| 405 | + | |
| 406 | + | |
| 407 | + | |
| 408 | + | |
| 409 | + | |
| 410 | + | |
| 411 | + | |
| 412 | + | |
359 | 413 | | |
360 | 414 | | |
361 | | - | |
| 415 | + | |
362 | 416 | | |
363 | | - | |
364 | | - | |
| 417 | + | |
| 418 | + | |
| 419 | + | |
365 | 420 | | |
366 | 421 | | |
367 | | - | |
| 422 | + | |
368 | 423 | | |
369 | 424 | | |
370 | | - | |
| 425 | + | |
371 | 426 | | |
372 | 427 | | |
373 | 428 | | |
374 | 429 | | |
| 430 | + | |
375 | 431 | | |
376 | 432 | | |
377 | 433 | | |
378 | 434 | | |
379 | | - | |
| 435 | + | |
| 436 | + | |
| 437 | + | |
| 438 | + | |
380 | 439 | | |
381 | 440 | | |
382 | 441 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
817 | 817 | | |
818 | 818 | | |
819 | 819 | | |
| 820 | + | |
| 821 | + | |
| 822 | + | |
| 823 | + | |
| 824 | + | |
| 825 | + | |
| 826 | + | |
| 827 | + | |
| 828 | + | |
| 829 | + | |
| 830 | + | |
| 831 | + | |
| 832 | + | |
| 833 | + | |
| 834 | + | |
| 835 | + | |
| 836 | + | |
| 837 | + | |
| 838 | + | |
| 839 | + | |
| 840 | + | |
| 841 | + | |
| 842 | + | |
| 843 | + | |
| 844 | + | |
| 845 | + | |
| 846 | + | |
| 847 | + | |
| 848 | + | |
| 849 | + | |
| 850 | + | |
| 851 | + | |
| 852 | + | |
| 853 | + | |
| 854 | + | |
| 855 | + | |
| 856 | + | |
| 857 | + | |
| 858 | + | |
| 859 | + | |
| 860 | + | |
| 861 | + | |
| 862 | + | |
| 863 | + | |
| 864 | + | |
| 865 | + | |
| 866 | + | |
| 867 | + | |
| 868 | + | |
| 869 | + | |
| 870 | + | |
| 871 | + | |
| 872 | + | |
| 873 | + | |
| 874 | + | |
| 875 | + | |
| 876 | + | |
| 877 | + | |
| 878 | + | |
| 879 | + | |
| 880 | + | |
| 881 | + | |
| 882 | + | |
| 883 | + | |
| 884 | + | |
| 885 | + | |
| 886 | + | |
| 887 | + | |
| 888 | + | |
| 889 | + | |
| 890 | + | |
| 891 | + | |
| 892 | + | |
| 893 | + | |
| 894 | + | |
| 895 | + | |
| 896 | + | |
| 897 | + | |
| 898 | + | |
| 899 | + | |
| 900 | + | |
| 901 | + | |
| 902 | + | |
| 903 | + | |
| 904 | + | |
| 905 | + | |
| 906 | + | |
| 907 | + | |
| 908 | + | |
| 909 | + | |
| 910 | + | |
| 911 | + | |
| 912 | + | |
| 913 | + | |
| 914 | + | |
| 915 | + | |
| 916 | + | |
| 917 | + | |
| 918 | + | |
0 commit comments