Estimation of Inferential Uncertainty in Assessing Expert Segmentation Performance from Staple

Olivier Commowick¹⁹ &
Simon K. Warfield¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5636))

Included in the following conference series:

International Conference on Information Processing in Medical Imaging

2777 Accesses
1 Citations

Abstract

The evaluation of the quality of segmentations of an image, and the assessment of intra- and inter-expert variability in segmentation performance, has long been recognized as a difficult task. Recently an Expectation Maximization (EM) algorithm for Simultaneous Truth and Performance Level Estimation (Staple), was developed to compute both an estimate of the reference standard segmentation and performance parameters from a set of segmentations of an image. The performance is characterized by the rate of detection of each segmentation label by each expert in comparison to the estimated reference standard.

This previous work provides estimates of performance parameters, but does not provide any information regarding their uncertainty. An estimate of this inferential uncertainty, if available, would allow estimation of confidence intervals for the values of the parameters, aid in the interpretation of the performance of segmentation generators, and help determine if sufficient data size and number of segmentations have been obtained to accurately characterize the performance parameters.

We present a new algorithm to estimate the inferential uncertainty of the performance parameters for binary segmentations. It is derived for the special case of the Staple algorithm based on established theory for general purpose covariance matrix estimation for EM algorithms. The bounds on performance estimates are estimated by the computation of the observed Information Matrix. We use this algorithm to study the bounds on performance estimates from simulated images with specified performance parameters, and from interactive segmentations of neonatal brain MRIs. We demonstrate that confidence intervals for expert segmentation performance parameters can be estimated with our algorithm. We investigate the influence of the number of experts and of the image size on these bounds, showing that it is possible to determine the number of image segmentations and the size of images necessary to achieve a chosen level of accuracy in segmentation performance assessment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Cautionary Analysis of STAPLE Using Direct Inference of Segmentation Truth

Optimal MAP Parameters Estimation in STAPLE - Learning from Performance Parameters versus Image Similarity Information

Manual Segmentation Errors in Medical Imaging. Proposing a Reliable Gold Standard

References

Huttenlocher, D., Klanderman, D., Rucklige, A.: Comparing images using the Hausdorff distance. IEEE Transactions on Pattern Analysis and Machine Intelligence 15(9), 850–863 (1993)
Article Google Scholar
Chalana, V., Kim, Y.: A methodology for evaluation of boundary detection algorithms on medical images. IEEE Transactions on Medical Imaging 16(5), 642–652 (1997)
Article Google Scholar
Dice, L.: Measures of the amount of ecologic association between species. Ecology 26(3), 297–302 (1945)
Article Google Scholar
Jaccard, P.: The distribution of flora in the alpine zone. New Phytologist 11, 37–50 (1912)
Article Google Scholar
Zou, K.H., Warfield, S.K., Bharatha, A., Tempany, C.M.C., Tempany, C., Kaus, M.R., Haker, S.J., Wells, W.M., Jolesz, F.A., Kikinis, R.: Statistical validation of image segmentation quality based on a spatial overlap index. Acad. Radiol. 11(2), 178–189 (2004)
Article Google Scholar
Gerig, G., Jomier, M., Chakos, M.: Valmet: A new validation tool for assessing and improving 3D object segmentation. In: Niessen, W.J., Viergever, M.A. (eds.) MICCAI 2001. LNCS, vol. 2208, pp. 516–523. Springer, Heidelberg (2001)
Chapter Google Scholar
Warfield, S.K., Zou, K.H., Wells, W.M.: Simultaneous truth and performance level estimation (STAPLE): an algorithm for the validation of image segmentation. IEEE Transactions on Medical Imaging 23(7), 903–921 (2004)
Article Google Scholar
McLachlan, G., Krishnan, T.: The EM Algorithm and Extensions. John Wiley and Sons, Chichester (1997)
MATH Google Scholar
Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society (Series B) 39 (1977)
Google Scholar
Meng, X., Rubin, D.: Using EM to obtain asymptotic variance-covariance matrices: the SEM algorithm. Journal of the American Statistical Association 86, 899–909 (1991)
Article Google Scholar
Oakes, D.: Direct calculation of the information matrix via the EM algorithm. J. R. Statistical Society 61(2), 479–482 (1999)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Computational Radiology Laboratory, Department of Radiology, Children’s Hospital, 300 Longwood Avenue, Boston, MA, 02115, USA
Olivier Commowick & Simon K. Warfield

Authors

Olivier Commowick
View author publications
You can also search for this author in PubMed Google Scholar
Simon K. Warfield
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical and Computer Engineering, Johns Hopkins University, 3400 North Charles Street, MD 21218, Baltimore, USA
Jerry L. Prince
Department of Radiology, Johns Hopkins University, 600 North Wolfe Street, MD 21287, Baltimore, USA
Dzung L. Pham
Division of Imaging and Applied Mathematics, OSEL, U.S. Food and Drug Administration, 10903 New Hampshire Avenue, MD 20993, Silver Spring, USA
Kyle J. Myers

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Commowick, O., Warfield, S.K. (2009). Estimation of Inferential Uncertainty in Assessing Expert Segmentation Performance from Staple . In: Prince, J.L., Pham, D.L., Myers, K.J. (eds) Information Processing in Medical Imaging. IPMI 2009. Lecture Notes in Computer Science, vol 5636. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02498-6_58

Download citation

DOI: https://doi.org/10.1007/978-3-642-02498-6_58
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02497-9
Online ISBN: 978-3-642-02498-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Estimation of Inferential Uncertainty in Assessing Expert Segmentation Performance from Staple

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A Cautionary Analysis of STAPLE Using Direct Inference of Segmentation Truth

Optimal MAP Parameters Estimation in STAPLE - Learning from Performance Parameters versus Image Similarity Information

Manual Segmentation Errors in Medical Imaging. Proposing a Reliable Gold Standard

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Estimation of Inferential Uncertainty in Assessing Expert Segmentation Performance from Staple

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A Cautionary Analysis of STAPLE Using Direct Inference of Segmentation Truth

Optimal MAP Parameters Estimation in STAPLE - Learning from Performance Parameters versus Image Similarity Information

Manual Segmentation Errors in Medical Imaging. Proposing a Reliable Gold Standard

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation