abstract

booktitle

title

volume

year

layout

series

publisher

issn

id

month

tex_title

firstpage

lastpage

page

order

cycles

bibtex_author

author

date

address

container-title

genre

issued

pdf

extras

A common failure mode of neural networks trained to classify abnormalities in medical images is their reliance on spurious features, which are features that are associated with the class label but are non-generalizable. In this work, we examine if supervising models with increased spatial specificity (i.e., information about the location of the abnormality) impacts model reliance on spurious features. We first propose a data model of spurious features and theoretically analyze the impact of increasing spatial specificity. We find that two properties of the data are impacted when we increase spatial specificity: the variance of the positively-labeled input pixels decreases and the mutual information between abnormal and spurious pixels decreases, both of which contribute to improved model robustness to spurious features. However, supervising models with greater spatial specificity incurs higher annotation costs, since training data must be labeled for the location of the abnormality, leading to a trade-off between annotation costs and model robustness to spurious features. We investigate this trade-off by varying the coarseness of the spatial specificity supplied and sweeping the quantity of training samples that have information about the abnormality location. Further, we assess if semi-supervised and contrastive learning methods improve the cost-robustness trade-off. We empirically examine the impact of supervising models with increased spatial specificity on two medical image datasets known to have spurious features: pneumothorax classification on chest x-rays and melanoma classification from dermoscopic images. We find that while models supervised with binary labels have near-random robust performance (robust AUROC of 0.46), increasing spatial specificity to bounding box detection and image segmentation achieves a robust AUROC of 0.72 and 0.82, respectively, on the pneumothorax classification task. We also observe this trend for the melanoma task, where segmentation models achieve a robust AUROC of 0.73, compared to worse than random performance for models trained with binary labels. Moreover, by leveraging semi-supervised and contrastive methods, models achieve a 5 point gain in robust AUROC when we have access to very few training samples.

Proceedings of the 7th Machine Learning for Healthcare Conference

Reducing Reliance on Spurious Features in Medical Image Classification with Spatial Specificity

182

2022

inproceedings

Proceedings of Machine Learning Research

PMLR

2640-3498

saab22a

0

Reducing Reliance on Spurious Features in Medical Image Classification with Spatial Specificity

760

784

760-784

760

false

Saab, Khaled and Hooper, Sarah and Chen, Mayee and Zhang, Michael and Rubin, Daniel and Re, Christopher

given	family
Khaled	Saab

given	family
Sarah	Hooper

given	family
Mayee	Chen

given	family
Michael	Zhang

given	family
Daniel	Rubin

given	family
Christopher	Re

2022-12-31

Proceedings of the 7th Machine Learning for Healthcare Conference

inproceedings

date-parts

2022

12

31

https://proceedings.mlr.press/v182/saab22a/saab22a.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2022-12-31-saab22a.md

2022-12-31-saab22a.md

Files

2022-12-31-saab22a.md

Latest commit

History

2022-12-31-saab22a.md

File metadata and controls