Extract references from related work section (Part 1)#15316
Extract references from related work section (Part 1)#15316subhramit merged 52 commits intoJabRef:mainfrom
Conversation
Review Summary by QodoAdd related work extraction and reference matching functionality
WalkthroughsDescription• Adds related work extraction feature to parse citations from PDF documents • Implements text parsing to extract citation markers and context from related work sections • Matches parsed references against library entries using duplicate detection • Inserts matched references as user-specific comments in target bibliography entries Diagramflowchart LR
A["Related Work Text"] -->|parse| B["RelatedWorkTextParser"]
B -->|extract snippets| C["RelatedWorkSnippet"]
D["PDF File"] -->|parse references| E["RelatedWorkReferenceResolver"]
E -->|extract entries| F["BibEntry References"]
C -->|match against| F
F -->|find duplicates| G["DuplicateCheck"]
G -->|create match results| H["RelatedWorkMatchResult"]
H -->|insert comments| I["RelatedWorkService"]
I -->|generate insertion results| J["RelatedWorkInsertionResult"]
File Changes1. jablib/src/main/java/org/jabref/logic/relatedwork/RelatedWorkSnippet.java
|
Code Review by Qodo
1.
|
This comment has been minimized.
This comment has been minimized.
Co-authored-by: Oliver Kopp <kopp.dev@gmail.com>
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
|
By the way, is it better to continue the work in this pr or create a new one, because I have some ideas about UI and supporting more citation formats |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
calixtus
left a comment
There was a problem hiding this comment.
everything looks really good now. just a few small suggestions / nitpicks
|
Is this all cleared? If so, please check the check boxes. #15316 (review) |
Dumb question: It seems I cannot check the check boxes in #15316 (review). Maybe I have to "copy markdown" and do it myself? |
Expected: "[2]" is found.
|
I will do this Edit: Requirement added.
I did not check it because we still need more tests to check edge cases. (texts contain "et al.", "dr.", etc.) I will work on generating more manual tests for #15351. |
Not dumb, my fault., I forgot that this wasn't your comment and one cannot just alter someone else comments. 😆 |
Leaving this link here for future reference. Edit - okay apparently you are coupling it with this PR itself. Good. |
Head branch was pushed to by a user without write access
|
When I was looking at https://docs.jabref.org/advanced/entryeditor/entrylinks today, I thought wouldn't it makes sense to put some of those references into their an own bibtex field, such as |
|
Please create an issue, so we can track that. As a comment, this will be lost. |



Related issues and pull requests
Closes #14085
PR Description
This PR adds a new feature that lets JabRef find existing BibEntries referenced in related work text and add the corresponding descriptive text to the
comments-{username}field.Users can now extract references from related work section in the attached PDF file, and add comment field to corresponding bib entries.
Change overview
Future work in the next pr(s)
(John 2026),(Lee et al. 2020; John 2026), example3Checklist
CHANGELOG.mdin a way that can be understood by the average user (if change is visible to the user)