This is the documentation for the Reading papers: Extraction of molecular interaction networks with large language models study.
Questions about the study can be addressed to Enio Gjerga, Philipp Wiesenbach or Prof. Christoph Dieterich or feel free to open an Issue for anything related to the code.
This study makes use of the RegulaTome corpus of literature, so we suggest to have a look at this study first.
Distributed under the GNU GPLv3 License.
The code and results have been organized in the following direcotries:
Here we provide the structured JSON files for each case study:
- PPI evaluations over the RegulaTome corpus (regulatome_eval_ppi)
- Synonyms of entities participating in PPI interactions from the RegulaTome corpus (regulatome_synonyms)
- PPI evaluations over the selected Manuscript corpus (manuscript_eval_ppi)
The JSON files being contained in each of these directories have been named according to the type as well as the LLM model used for evaluation (Please refer to the study manuscript for the nomenclature).
Here we provide the scripts that were used to calculate and visualize the precision-recall values for PPI relation extraction in the RegulaTome and the selected manuscript corpus.