You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+21-2Lines changed: 21 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,9 +4,20 @@
4
4
5
5
Contains code of the paper titled _"Learning an Unreferenced Metric for Online Dialogue Evaluation"_ to appear at **ACL 2020**, [Arxiv](https://arxiv.org/abs/2005.00583)
To get the trained models, [download the data from here](https://drive.google.com/file/d/1Ysso9hdzSenK13LjOFombyXYqA_kv-Vy/view?usp=sharing).
14
+
- Get the `convai2` train and test data and pre-trained Distilbert [embeddings here](https://drive.google.com/file/d/1VVcsxmUrDSRIfunPWe9UO1aeCz-lITNy/view?usp=sharing). Download and unzip in the folder `convai2_data`.
15
+
- Get the trained model checkpoints [from here](https://drive.google.com/file/d/1Ysso9hdzSenK13LjOFombyXYqA_kv-Vy/view?usp=sharing). Download and unzip into the folder `full_acl_runs`.
16
+
- For individual licensing reasons we cannot release the train/test data of MultiWoz, Frames and DailyDialog. Please [send me a mail](mailto:[email protected]) if you need them!
17
+
- Run inference using `./run_inference.sh`
18
+
19
+
**N.B.** - For model names and checkpoints, please refer to `run_inference.sh` script.
20
+
10
21
11
22
## Computing Backtranslation
12
23
@@ -38,6 +49,8 @@ For baselines, add the appropriate flag:
38
49
--train_baseline [infersent/ruber/bertnli]
39
50
```
40
51
52
+
An example training script is provided at [`run_training.sh`](run_training.sh)
53
+
41
54
## Inference Script
42
55
43
56
```
@@ -48,7 +61,8 @@ For baselines, add the appropriate flag:
Outputs the results in a `jsonl` file. To measure human correaltion with [See et al 2019](https://parl.ai/projects/controllable_dialogue/), specify `--human_eval` flag and `--human_eval_file` location.
64
+
- Outputs the results in a `jsonl` file. To measure human correaltion with [See et al 2019](https://parl.ai/projects/controllable_dialogue/), specify `--human_eval` flag and `--human_eval_file` location.
65
+
- We have also added the script to run inference on our trained checkpoints - [`run_inference.sh`](run_inference.sh).
52
66
53
67
## Acknowledgements
54
68
@@ -60,6 +74,11 @@ Outputs the results in a `jsonl` file. To measure human correaltion with [See et
60
74
- ParlAI - https://parl.ai/
61
75
- See et al 2019 data - https://parl.ai/projects/controllable_dialogue/
0 commit comments