"BendVLM: Test-Time Debiasing of Vision-Language Embeddings" (NeurIPS 2024)

Demo code for "BendVLM: Test-Time Debiasing of Vision-Language Embeddings" (NeurIPS 2024).

Installing Packages

pip install -r requirements.txt

Preparing Datasets

Please download the Celeba_HQ_dialog and FairFace datasets from their sources.

Next, run the following to process the datasets (precompute the CLIP embeddings):

python create_dataset.py --_MODEL_NAME 'clip-vit-base-patch16' --data_path 'PATH_TO_FAIRFACE' --meta_data_file_name 'fairface_label_train.csv' --dataset_name 'fairface'
python create_dataset.py --_MODEL_NAME 'clip-vit-large-patch14' --data_path 'PATH_TO_FAIRFACE'  --meta_data_file_name 'fairface_label_train.csv' --dataset_name 'fairface'
python create_dataset.py --_MODEL_NAME 'clip-vit-base-patch16' --data_path 'PATH_TO_CELEBA'  --meta_data_file_name 'CelebAMask-HQ-attribute-anno.txt' --dataset_name 'celeba'
python create_dataset.py --_MODEL_NAME 'clip-vit-large-patch14' --data_path 'PATH_TO_CELEBA' --meta_data_file_name 'CelebAMask-HQ-attribute-anno.txt' --dataset_name 'celeba'

Pre-Generating Query Augmentations

Rather than generating gender and race augmentations on-the-fly using our AttributeAugment componenent, for the purposes of our experiments only we pre-generate the augmenations for efficiency. You can pre-generate these augmentations by running:

python get_pregenerated_attribute_augmentations.py --query_type "hair" --att_to_debias "gender"
python get_pregenerated_attribute_augmentations.py --query_type "stereotype" --att_to_debias "gender"
python get_pregenerated_attribute_augmentations.py --query_type "stereotype" --att_to_debias "race"

Experimental Results

You can perform a comparative analysis between Bend_VLM and the compared methods by running the cells in the demo_debias.ipynb notebook. You can change the dataset, protected atttribute (race/gender), query class type (hair or stereotype), and CLIP embedding model by passing in the appropriate config file in config = yaml.safe_load(open("experimental_configs/celeba_hair_gender_clip-vit-base-patch16.yml")). Each config file is named as {dataset}_{protected_attribute}_{query_type}_{model_name}.

Citation

@inproceedings{gerych2024bendvlm,
 title={BendVLM: Test Time Debiasing Of Pretrained Vision-Language Models},
 author={Walter Gerych and Haoran Zhang and Kimia Hamidieh and Eileen Pan and Maanas Sharma and Thomas Hartvigsen and Marzyeh Ghassemi},
 booktitle = {Advances in Neural Information Processing Systems},
 year = {2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data/fold_indices		data/fold_indices
experimental_configs		experimental_configs
query_templates		query_templates
README.md		README.md
attribute_augment_component.py		attribute_augment_component.py
bend_utils.py		bend_utils.py
create_dataset.py		create_dataset.py
demo_debias.ipynb		demo_debias.ipynb
get_pregenerated_attribute_augmentations.py		get_pregenerated_attribute_augmentations.py
queries.py		queries.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

"BendVLM: Test-Time Debiasing of Vision-Language Embeddings" (NeurIPS 2024)

Installing Packages

Preparing Datasets

Pre-Generating Query Augmentations

Experimental Results

Citation

About

Uh oh!

Releases

Packages

Languages

waltergerych/bend_vlm

Folders and files

Latest commit

History

Repository files navigation

"BendVLM: Test-Time Debiasing of Vision-Language Embeddings" (NeurIPS 2024)

Installing Packages

Preparing Datasets

Pre-Generating Query Augmentations

Experimental Results

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages