GitHub - ghydsgaaa/Cambridge-AUT-dataset: Repository for the paper at IC2023: A new dataset and method for creativity assessment using the alternate uses task

A New Dataset and Method for Creativity Assessment Using the Alternate Uses Task

add paper url after publication Luning Sun, Hongyi Gu, Rebecca Myers, Zheng Yuan

This repo contains:

The Dataset collected in AUT test using two prompts: "bowl" and "paperclip"
Finetuned binary classification models using BERT and ROBERTA
classification answer provided by gpt3.5-turbo

Paper abstract: Creativity ratings by humans for the alternate uses task (AUT) tend to be subjective and inefficient. To automate the scoring process of the AUT, previous literature suggested using semantic dis- tance from non-contextual models. In this paper, we extend this line of research by including contextual semantic models and more impor- tantly, exploring the feasibility of predicting creativity ratings with su- pervised discriminative machine learning models. Based on a newly col- lected dataset, our results show that supervised models can successfully classify between creative and non-creative responses even with unbal- anced data, and can generalise well to out-of-domain unseen prompts

Cambridge AUT dataset

In order to faciliate future development of auto-evaluation on AUT test, we hereby release our AUT dataset. It is collected as part of a larger project on creativity assessment. The responses arer rated by three raters on their originality, using a Likert scale from 0 to 4, where 0 indicates a not valid or not relevant use, 1 a common use without any originality, 2 an uncommon use with limited originality, and 3 and 4 original uses with moderate and extreme creativity, respectively

cambridge AUT dataset	prompt
bowl AUT dataset.xlsx	bowl
paperclip AUT datset.xlsx	paperclip

pretrained models

We provide the following models mentioned in our paper:

model type	prompt
bert-base-uncased	bowl
bert-base-uncased	paperclip
bert-base-uncased	both
roberta-base-uncased	bowl
roberta-base-uncased	paperclip
roberta-base-uncased	both

Gpt 3.5 turbo classfication

We apply ChatGPT (gpt-3.5-turbo at temperature 0) to the same task as our models for comparison. The prompt used are provided in (https://github.com/ghydsgaaa/Cambridge-AUT-dataset/blob/main/gpt3.5%20turbo/prompts.py) and results are provided in (link to be added)

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
finetuning		finetuning
gpt3.5 turbo		gpt3.5 turbo
model		model
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A New Dataset and Method for Creativity Assessment Using the Alternate Uses Task

Cambridge AUT dataset

pretrained models

Gpt 3.5 turbo classfication

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

A New Dataset and Method for Creativity Assessment Using the Alternate Uses Task

Cambridge AUT dataset

pretrained models

Gpt 3.5 turbo classfication

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

Packages