SCAI Eval 2024 Metric: Simplicity

Calculates Flesch-Kincaid readability levels for SCAI Eval.

Labels each turn based on the readability score of the response, loosely based on the table in Wikipedia:

if score <= 30:
    labels[turn_id] = ["very-difficult"]
elif score <= 60:
    labels[turn_id] = ["difficult"]
elif score <= 90:
    labels[turn_id] = ["easy"]
else:
    labels[turn_id] = ["very-easy"]

Calculation is based on LFTK.

Local

Setup

python3 -m venv venv
source venv/bin/activate
pip3 install -r requirements.txt
python -m spacy download en_core_web_sm

Run

python3 simplicity.py example.ndjson

Dockerized

docker build -t registry.webis.de/code-research/tira/tira-user-scai-info/scai-eval24-metric-simplicity:1.0.0 .
docker run --rm -it -v $PWD:/data registry.webis.de/code-research/tira/tira-user-scai-info/scai-eval24-metric-simplicity:1.0.0 /data/example.ndjson

Run on SCAI Eval 2024 data (will be downloaded automatically)

tira-run \
  --input-dataset scai-eval-2024-metric-submission/scai-eval24-2023-09-26-20230926-training \
  --image registry.webis.de/code-research/tira/tira-user-scai-info/scai-eval24-metric-simplicity:1.0.0 \
  --evaluate true \
  --command 'python3 /app/simplicity.py $inputDataset/* > $outputDir/run.json'

# view labels
less tira-output/run.json

In step 2 of the "Create New Docker Software" dialog in TIRA, click on "PUSH NEW DOCKER IMAGE" to get instructions to upload your own image. Use the same command as above in TIRA. For this metric, it is:

python3 /app/simplicity.py $inputDataset/* > $outputDir/run.json

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
example.ndjson		example.ndjson
requirements.txt		requirements.txt
simplicity.py		simplicity.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SCAI Eval 2024 Metric: Simplicity

Local

Dockerized

About

Uh oh!

Uh oh!

Languages

License

search-oriented-conversational-ai/scai-eval24-metric-simplicity

Folders and files

Latest commit

History

Repository files navigation

SCAI Eval 2024 Metric: Simplicity

Local

Dockerized

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages