Evaluative AI

Evaluative AI is a tool for decision support by providing positive and negative evidence for a given hypothesis. This tool is currently used for two types of dataset (tabular vs. image) as shown below.

1. Tabular dataset

Figure 1: Example of tabular data analysis showing positive and negative evidence for hypothesis low.

2. Image dataset

Figure 2: Example of image data analysis showing positive and negative evidence for a skin cancer diagnosis

This tool finds high-level human-understandable concepts (e.g., Irregular Pigmentation) in an image and generates the Weight of Evidence (WoE) for each hypothesis in the decision-making process.

Prerequisites

Environment

Python 3.10.4
CUDA 12.2.0
UCX-CUDA 1.13.1-CUDA-12.2.0
cuDNN 8.9.3.28-CUDA-12.2.0
Graphviz 5.0.0
torch 2.1.2+cu121
torchvision 0.16.2+cu121

Installation

virtualenv ~/venvs/venv-3.10.4-cuda12.2 #create the env
source ~/venvs/venv-3.10.4-cuda12.2/bin/activate #activate the env
pip3 install torch torchvision torchaudio
pip3 install -r requirements.txt

Code structure

datasets
├── 7-point-criteria
└── HAM10000
save_model
EvaluativeAI
├── Explainers
├── README.md
├── classifiers.py
├── config.py
├── eval.py
├── ice
│   ├── __init__.py
│   ├── channel_reducer.py
│   ├── explainer.py
│   ├── model_wrapper.py
│   └── utils.py
├── learn_concepts_dataset.py
├── main.py
├── pcbm
│   ├── concepts
│   │   ├── __init__.py
│   │   └── concept_utils.py
│   ├── data
│   │   ├── __init__.py
│   │   ├── concept_loaders.py
│   │   ├── constants.py
│   │   └── derma_data.py
│   └── models
│       ├── __init__.py
│       ├── derma_models.py
│       ├── model_zoo.py
│       └── pcbm_utils.py
├── pcbm_output
├── preprocessing
│   ├── cnn_backbones.py
│   ├── data_utils.py
│   ├── initdata.py
│   └── params.py
├── reproducibility
│   └── script
│       ├── pretrained.sh
│       └── scratch.sh
├── results
├── requirements.txt
├── train_cnn.py
├── utils.py
└── woe
    ├── __init__.py
    ├── explainers.py
    ├── woe.py
    └── woe_utils.py
└── online_data
└── example-image.ipynb
└── example-tabular.ipynb

Datasets

Ames Housing Dataset: Github Link
HAM10000 dataset: Skin lesion classification dataset - Kaggle Link
Derm7pt: Dermatology concepts dataset - Link

Please put the datasets in the right folder followed the code structure above.

Usage

Reproducibility

To reproduce the results in the paper From Evidence to Decision: Exploring Evaluative AI , please either use pre-trained models or train from scratch as described below. Then, run python eval.py > results/computational.txt to see the results.

Use pre-trained models

Available pre-trained models Download
- Pre-trained CNN backbones Resnet50, ResneXt50 and Resnet152: in the folder save_model
- Pre-trained concept models ICE, PCBM: pickle files in the folder Explainers
- Pre-trained concept bank for PCBM: in the folder pcbm_output
Please refer to reproducibility/script/pretrained.sh for the training using pre-trained models above

Train from scratch

Step by step to train from scratch
- Train the CNN backbone model
- For unsupervised learning concept, train the concept model ICE
- For supervised learning concept, we first need to train the concept bank using the 7pt checklist dataset, then train the concept model PCBM using the HAM10000 dataset
Please refer to reproducibility/script/scratch.sh for training from scratch

Paper

@article{le2024evidence,
  title={{From Evidence to Decision: Exploring Evaluative AI}},
  author={Le, Thao and Miller, Tim and Sonenberg, Liz and Singh, Ronal and Soyer, H Peter},
  journal={arXiv preprint arXiv:2402.01292},
  year={2024}
}

Supplementary material: https://thaole25.github.io/aij-supp/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evaluative AI

1. Tabular dataset

2. Image dataset

Prerequisites

Environment

Installation

Code structure

Datasets

Usage

Reproducibility

Use pre-trained models

Train from scratch

Paper

References

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
ice		ice
img		img
online_data		online_data
pcbm		pcbm
preprocessing		preprocessing
reproducibility/script		reproducibility/script
results		results
woe		woe
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
classifiers.py		classifiers.py
config.py		config.py
eval.py		eval.py
example-image.ipynb		example-image.ipynb
example-tabular.ipynb		example-tabular.ipynb
learn_concepts_dataset.py		learn_concepts_dataset.py
main.py		main.py
requirements.txt		requirements.txt
train_cnn.py		train_cnn.py
utils.py		utils.py

Folders and files

Latest commit

History

Repository files navigation

Evaluative AI

1. Tabular dataset

2. Image dataset

Prerequisites

Environment

Installation

Code structure

Datasets

Usage

Reproducibility

Use pre-trained models

Train from scratch

Paper

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages