Skip to content

Commit a7ddc78

Browse files
authored
Update README.md
Added reproduction instruction for the ELN data.
1 parent 2685fc2 commit a7ddc78

File tree

1 file changed

+14
-0
lines changed

1 file changed

+14
-0
lines changed

README.md

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -72,6 +72,20 @@ The library contains the class `DrfpEncoder` with one public method `encode`.
7272
# Reproduce
7373
Want to reproduce the results in our paper? You can find all the data in the `data` folder and encoding and training scripts in the `scripts` folder.
7474

75+
## Electronic Laboratory Notebook (ELN) Experiment
76+
To reproduce the experiments on the electronic laboratory notebook (ELN) data:
77+
78+
1. Encode the data using the script `encode_az_reactions.py`. This will write the files `az-2048-3-true.pkl` and `az-2048-3-true.pkl.gz` to the folder `data/az`.
79+
```bash
80+
cd scripts
81+
python encoding/encode_az_reactions.py
82+
```
83+
2. Train and test the xgboost model using the script `yield_prediction_az.py` (or `yield_prediction_az_rf.py` for the random forest model):
84+
```bash
85+
python training/yield_prediction_az.py
86+
python training/yield_prediction_az_rf.py
87+
```
88+
7589
# Cite Us
7690
```
7791
@article{probst2022reaction,

0 commit comments

Comments
 (0)