Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

Neurocomputing (Amst). 2019 Sep 3:335:34-45. doi: 10.1016/j.neucom.2019.01.103. Epub 2019 Feb 7.

Authors

Guotai Wang^{1

2

3}, Wenqi Li^{1

2}, Michael Aertsen⁴, Jan Deprest^{1

4

5

6}, Sébastien Ourselin², Tom Vercauteren^{1

2

6}

Affiliations

¹ Wellcome / EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK.
² School of Biomedical Engineering and Imaging Sciences, King's College London, London, UK.
³ School of Mechanical and Electrical Engineering, University of Electronic Science and Technology of China, Chengdu, China.
⁴ Department of Radiology, University Hospitals Leuven, Leuven, Belgium.
⁵ Institute for Women's Health, University College London, London, UK.
⁶ Department of Obstetrics and Gynaecology, University Hospitals Leuven, Leuven, Belgium.

Abstract

Despite the state-of-the-art performance for medical image segmentation, deep convolutional neural networks (CNNs) have rarely provided uncertainty estimations regarding their segmentation outputs, e.g., model (epistemic) and image-based (aleatoric) uncertainties. In this work, we analyze these different types of uncertainties for CNN-based 2D and 3D medical image segmentation tasks at both pixel level and structure level. We additionally propose a test-time augmentation-based aleatoric uncertainty to analyze the effect of different transformations of the input image on the segmentation output. Test-time augmentation has been previously used to improve segmentation accuracy, yet not been formulated in a consistent mathematical framework. Hence, we also propose a theoretical formulation of test-time augmentation, where a distribution of the prediction is estimated by Monte Carlo simulation with prior distributions of parameters in an image acquisition model that involves image transformations and noise. We compare and combine our proposed aleatoric uncertainty with model uncertainty. Experiments with segmentation of fetal brains and brain tumors from 2D and 3D Magnetic Resonance Images (MRI) showed that 1) the test-time augmentation-based aleatoric uncertainty provides a better uncertainty estimation than calculating the test-time dropout-based model uncertainty alone and helps to reduce overconfident incorrect predictions, and 2) our test-time augmentation outperforms a single-prediction baseline and dropout-based multiple predictions.

Keywords: Convolutional neural networks; Data augmentation; Medical image segmentation; Uncertainty estimation.

Abstract

Grants and funding