Code for the paper "Solving the Challenge Set without Solving the Task: On Winograd Schemas as a Test of Pronominal Coreference Resolution" presented at CoNLL 2024.
The precise package versions used are listed in the requirements.txt.
As a starting point, we use the formatted datasets made available from https://github.com/ianporada/coref-data
See preprocessing/ for the scripts used to preprocess these datasets.
The resulting formatted data is available as jsonlines upon request (due to licensing requirements). The permissive license subset of the data is publicly available at coref-data/pcr_public_datasets.
Model predictions (baselines and LMs) are available in jsonlines format at coref-data/pcr_model_preds.
For almost all models, we use the publicly released weights. The finetuned Llama 3.1 8B model weights are available on Hugging Face here: https://huggingface.co/ianporada/llama-3.1-8b-pcr
LM inference and preprocessing code is located at llm/
Baseline inference code (dcoref and Maverick) is available at baselines/
Paper: https://arxiv.org/abs/2410.09448
@misc{porada2024solvingchallengesetsolving,
title={Solving the Challenge Set without Solving the Task: On Winograd Schemas as a Test of Pronominal Coreference Resolution},
author={Ian Porada and Jackie Chi Kit Cheung},
year={2024},
eprint={2410.09448},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2410.09448},
}