The data/few_example_data of Paragraph2Action dataset is a subset of https://github.com/rxn4chemistry/paragraph2actions/tree/main/test_data
The whole paragraph2action dataset data/hand_annotated is available upon request (with pistachio license).
The processed dataset is in data/processed_data.
See in prompt_chatgpt_for_paragraph2action.ipynb
See in finetune_chatgpt_for_paragraph2action.ipynb
Training Code in finetune_llms_full_for_paragraph2action.py
Inferencing Code in vllm_inference_full_finetuned_llms.ipynb
Training Code in finetune_llms_peft_for_paragraph2action.py
Inferencing Code in vllm_inference_peft_finetuned_llms.ipynb
Training Code in finetune_bart_or_t5_for_paragraph2action.py
All predictions will be saved in results/predictions
Evaluating codes are in evaluate_for_paragraph2action.ipynb