Exploring the interplay of label bias with subgroup size and separability

This repository contains code associated with the paper:

E. A. M. Stanley, R. Mehta, M. Roschewitz, N. D. Forkert, B. Glocker. Exploring the interplay of label bias with subgroup size and separability: A case study in mammographic density classification. MICCAI Fairness of AI in Medical Imaging Workshop, 2025.

[arXiv][Proceedings]

Overview

/data_preproc contains notebooks for filtering the metadata associated with the images, and applying label bias (as described in sections 2.1 and 2.2 of the paper).
/models contains python files for training ResNet-18 models for breast density classification (mammo-net-density.py and mammo-net-density-label-bias.py), as well as imaging manufacturer (mammo-net-manufacturer.py) and pseudo-subgroup (mammo-net-pseudo-subgroup.py) classification. run_models.sh contains example commands for running these files.
/analysis contains notebooks for evaluating subgroup separability (model_evaluation_separability.ipynb), evaluating breast density classification performance (model_evaluation_clean.ipynb and model_evaluation_labelbias.ipynb), and performing feature inspection along the first principal component from the penultimate layer of the trained models (model_inspection.ipynb).

The environment.yml file allows for installation of all dependencies in a conda environment.

Data

Data used in this work comes from the EMory BrEast imaging Dataset (EMBED). Data requests and documentation can be accessed here: github.com/Emory-HITI/EMBED_Open_Data/.

Funding acknowledgements

E.A.M.S. and N.D.F. acknowledge support from the Natural Science and Engineering Research Council of Canada, the Killam Trusts, Alberta Innovates, the Canada Research Chairs Program, and the River Fund at Calgary Foundation. M.R. was funded by an Imperial College London President’s PhD Scholarship and a Google PhD Fellowship. B.G. received support from the Royal Academy of Engineering as part of his Kheiron/RAEng Research Chair. The project was supported by the European Union's Horizon Europe research and innovation programme for the AI-POD project under grant agreement 101080302. Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or HaDEA. Neither the European Union nor the granting authority can be held responsible for them.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
analysis		analysis
data_preproc		data_preproc
models		models
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Exploring the interplay of label bias with subgroup size and separability

Overview

Data

Funding acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Exploring the interplay of label bias with subgroup size and separability

Overview

Data

Funding acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages