Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3368555.3384458acmconferencesArticle/Chapter ViewAbstractPublication PageschilConference Proceedingsconference-collections
research-article
Open access

An adversarial approach for the robust classification of pneumonia from chest radiographs

Published: 02 April 2020 Publication History

Abstract

While deep learning has shown promise in the domain of disease classification from medical images, models based on state-of-the-art convolutional neural network architectures often exhibit performance loss due to dataset shift. Models trained using data from one hospital system achieve high predictive performance when tested on data from the same hospital, but perform significantly worse when they are tested in different hospital systems. Furthermore, even within a given hospital system, deep learning models have been shown to depend on hospital- and patient-level confounders rather than meaningful pathology to make classifications. In order for these models to be safely deployed, we would like to ensure that they do not use confounding variables to make their classification, and that they will work well even when tested on images from hospitals that were not included in the training data. We attempt to address this problem in the context of pneumonia classification from chest radiographs. We propose an approach based on adversarial optimization, which allows us to learn more robust models that do not depend on confounders. Specifically, we demonstrate improved out-of-hospital generalization performance of a pneumonia classifier by training a model that is invariant to the view position of chest radiographs (anterior-posterior vs. posterior-anterior). Our approach leads to better predictive performance on external hospital data than both a standard baseline and previously proposed methods to handle confounding, and also suggests a method for identifying models that may rely on confounders.

References

[1]
Marcus A Badgeley, John R Zech, Luke Oakden-Rayner, Benjamin S Glicksberg, Manway Liu, William Gale, Michael V McConnell, Bethany Percha, Thomas M Snyder, and Joel T Dudley. 2019. Deep learning predicts hip fracture using confounding patient and healthcare variables. npj Digital Medicine 2, 1 (2019), 31. 2398-6352
[2]
Shai Ben-David, John Blitzer, Koby Crammer, Alex Kulesza, Fernando Pereira, and Jennifer Wortman Vaughan. 2010. A theory of learning from different domains. Machine learning 79, 1-2 (2010), 151--175. 0885-6125
[3]
Bálint Botz. 2017. A Few Thoughts About CheXNet --- And The Way Human Performance Should (And Should Not) Be Measured. Web (2017), 1--4. https://medium.com/@BalintBotz/a-few-thoughts-about-chexnet-and-the-way-human-performance-should-and-should-not-be-measured-68031dca7bf
[4]
Harrison Edwards and Amos Storkey. 2015. Censoring Representations with an Adversary. (11 2015). https://arxiv.org/abs/1511.05897
[5]
Gabriel Erion, Joseph D Janizek, Pascal Sturmfels, Scott Lundberg, and Su-In Lee. 2019. Learning Explainable Models Using Attribution Priors. arXiv preprint arXiv:1906.10670 (2019). https://arxiv.org/abs/1906.10670
[6]
Andre Esteva, Brett Kuprel, Roberto A Novoa, Justin Ko, Susan M Swetter, Helen M Blau, and Sebastian Thrun. 2017. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 7639 (2017), 115. 1476-4687
[7]
Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand, and Victor Lempitsky. 2016. Domain-adversarial training of neural networks. The Journal of Machine Learning Research 17, 1 (2016), 2030--2096. 1532-4435
[8]
J Raymond Geis, Adrian P Brady, Carol C Wu, Jack Spencer, Erik Ranschaert, Jacob L Jaremko, Steve G Langer, Andrea Borondy Kitts, Judy Birch, William F Shields, Robert van den Hoven van Genderen, Elmar Kotter, Judy Wawira Gichoya, Tessa S Cook, Matthew B Morgan, An Tang, Nabile M Safdar, and Marc Kohli. 2019. Ethics of Artificial Intelligence in Radiology: Summary of the Joint European and North American Multisociety Statement. Radiology 293, 2 (10 2019), 436--440. 0033-8419
[9]
Varun Gulshan, Lily Peng, Marc Coram, Martin C Stumpe, Derek Wu, Arunachalam Narayanaswamy, Subhashini Venugopalan, Kasumi Widner, Tom Madams, and Jorge Cuadros. 2016. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. Jama 316, 22 (2016), 2402--2410. 0098-7484
[10]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.
[11]
Gao Huang, Zhuang Liu, Laurens van der Maaten, and Kilian Q. Weinberger. 2016. Densely Connected Convolutional Networks. (8 2016). http://arxiv.org/abs/1608.06993
[12]
Eui Jin Hwang, Ju Gang Nam, Woo Hyeon Lim, Sae Jin Park, Yun Soo Jeong, Ji Hee Kang, Eun Kyoung Hong, Taek Min Kim, Jin Mo Goo, Sunggyun Park, Ki Hwan Kim, and Chang Min Park. 2019. Deep Learning for Chest Radiograph Diagnosis in the Emergency Department. Radiology 293, 3 (10 2019), 573--580. 0033-8419
[13]
Jeremy Irvin, Pranav Rajpurkar, Michael Ko, Yifan Yu, Silviana Ciurea-Ilcus, Chris Chute, Henrik Marklund, Behzad Haghgoo, Robyn Ball, and Katie Shpanskaya. 2019. Chexpert: A large chest radiograph dataset with uncertainty labels and expert comparison. arXiv preprint arXiv:1901.07031 (2019).
[14]
Mehran Javanmardi and Tolga Tasdizen. 2018. Domain adaptation for biomedical image segmentation using adversarial training. In 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018). IEEE, 554--558.
[15]
Alistair E W Johnson, Tom J Pollard, Seth Berkowitz, Nathaniel R Greenbaum, Matthew P Lungren, Chih-ying Deng, Roger G Mark, and Steven Horng. 2019. MIMIC-CXR: A large publicly available database of labeled chest radiographs. arXiv preprint arXiv:1901.07042 (2019).
[16]
Max A Little and Reham Badawy. 2019. Causal bootstrapping. arXiv preprint arXiv:1910.09648 (2019).
[17]
Gilles Louppe, Michael Kagan, and Kyle Cranmer. 2017. Learning to pivot with adversarial networks. In Advances in neural information processing systems. 981--990.
[18]
David Madras, Elliot Creager, Toniann Pitassi, and Richard Zemel. 2018. Learning adversarially fair and transferable representations. arXiv preprint arXiv:1802.06309 (2018).
[19]
Faisal Mahmood, Richard Chen, and Nicholas J Durr. 2018. Unsupervised reverse domain adaptation for synthetic medical images via adversarial training. IEEE transactions on medical imaging 37, 12 (2018), 2572--2581. 0278-0062
[20]
M S Niederman, J B Bass, G Douglas Campbell, A M Fein, R F Grossman, L A Mandell, T J Marrie, A Torres, and V L Yu. 1993. Guidelines for the initial management of adults with community-acquired pneumonia: diagnosis, assessment of severity, and initial antimicrobial therapy. American Review of Respiratory Disease 148, 5 (1993), 1418--1426. 0003-0805
[21]
Luke Oakden-Rayner. 2018. CheXNet: an in-depth review. URL: https://lukeoakdenrayner. wordpress. com/2018/01/24/chexnetan-in-depth-review (2018).
[22]
Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan, Edward Yang, Zachary DeVito, Zeming Lin, Alban Desmaison, Luca Antiga, and Adam Lerer. 2017. Automatic differentiation in pytorch. (2017).
[23]
Judea Pearl and Elias Bareinboim. 2011. Transportability across studies: A formal approach. Technical Report.
[24]
Eduardo H. P. Pooch, Pedro L. Ballester, and Rodrigo C. Barros. 2019. Can we trust deep learning models diagnosis? The impact of domain shift in chest radiograph classification. arXiv preprint arXiv:1909.01940 (2019). http://arxiv.org/abs/1909.01940
[25]
Elizabeth Puddy and Catherine Hill. 2007. Interpretation of the chest radiograph. Continuing Education in Anaesthesia, Critical Care and Pain 7, 3 (2007), 71--75. 1743-1816
[26]
Joaquin Quionero-Candela, Masashi Sugiyama, Anton Schwaighofer, and Neil D Lawrence. 2009. Dataset shift in machine learning. The MIT Press.
[27]
Maithra Raghu, Chiyuan Zhang, Jon Kleinberg, and Samy Bengio. 2019. Transfusion: Understanding transfer learning with applications to medical imaging. arXiv preprint arXiv:1902.07208 (2019).
[28]
Pranav Rajpurkar, Jeremy Irvin, Kaylie Zhu, Brandon Yang, Hershel Mehta, Tony Duan, Daisy Ding, Aarti Bagul, Curtis Langlotz, Katie Shpanskaya, Matthew P. Lungren, and Andrew Y. Ng. 2017. CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning. (11 2017). http://arxiv.org/abs/1711.05225
[29]
Anil Rao, Joao M Monteiro, Janaina Mourao-Miranda, and Alzheimer's Disease Initiative. 2017. Predictive modelling using neuroimaging data in the presence of confounds. NeuroImage 150 (2017), 23--49. 1053-8119
[30]
Ramprasaath R. Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, and Dhruv Batra. 2016. Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization. (10 2016).
[31]
Hidetoshi Shimodaira. 2000. Improving predictive inference under covariate shift by weighting the log-likelihood function. Journal of statistical planning and inference 90, 2 (2000), 227--244. 0378-3758
[32]
Daniel Smilkov, Nikhil Thorat, Been Kim, Fernanda Viégas, and Martin Wattenberg. 2017. Smoothgrad: removing noise by adding noise. arXiv preprint arXiv:1706.03825 (2017).
[33]
Adarsh Subbaswamy and Suchi Saria. 2019. From development to deployment: dataset shift, causality, and shift-stable models in health AI. Biostatistics (11 2019). 1465-4644
[34]
Masashi Sugiyama, Taiji Suzuki, Shinichi Nakajima, Hisashi Kashima, Paul von Bünau, and Motoaki Kawanabe. 2008. Direct importance estimation for covariate shift adaptation. Annals of the Institute of Statistical Mathematics 60, 4 (2008), 699--746. 0020-3157
[35]
Mukund Sundararajan, Ankur Taly, and Qiqi Yan. 2017. Axiomatic attribution for deep networks. In Proceedings of the 34th International Conference on Machine Learning-Volume 70. JMLR. org, 3319--3328.
[36]
Saskia F van Vugt, Theo J M Verheij, Pim A de Jong, Chris C Butler, Kerenza Hood, Samuel Coenen, Herman Goossens, Paul Little, and Berna D L Broekhuizen. 2013. Diagnosing pneumonia in patients with acute cough: clinical judgment compared to chest radiography. European Respiratory Journal 42, 4 (10 2013), 1076 LP -- 1082.
[37]
Christina Wadsworth, Francesca Vera, and Chris Piech. 2018. Achieving fairness through adversarial learning: an application to recidivism prediction. arXiv preprint arXiv:1807.00199 (2018).
[38]
John R. Zech. 2018. What are radiological deep learning models actually learning? medium.com (2018).
[39]
John R. Zech, Marcus A. Badgeley, Manway Liu, Anthony B. Costa, Joseph J. Titano, and Eric Karl Oermann. 2018. Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study. PLOS Medicine 15, 11 (11 2018), e1002683. 1549-1676

Cited By

View all
  • (2024)MV-MFF: Multi-View Multi-Feature Fusion Model for Pneumonia ClassificationDiagnostics10.3390/diagnostics1414156614:14(1566)Online publication date: 19-Jul-2024
  • (2024)Towards the Generation of Medical Imaging Classifiers Robust to Common PerturbationsBioMedInformatics10.3390/biomedinformatics40200504:2(889-910)Online publication date: 1-Apr-2024
  • (2024)AMIKOMNET: Novel Structure for a Deep Learning Model to Enhance COVID-19 Classification Task PerformanceBig Data and Cognitive Computing10.3390/bdcc80700778:7(77)Online publication date: 9-Jul-2024
  • Show More Cited By

Index Terms

  1. An adversarial approach for the robust classification of pneumonia from chest radiographs

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        CHIL '20: Proceedings of the ACM Conference on Health, Inference, and Learning
        April 2020
        265 pages
        ISBN:9781450370462
        DOI:10.1145/3368555
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 02 April 2020

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Adversarial Training
        2. Chest Radiograph
        3. Covariate Shift
        4. Deep Learning
        5. Distributional Robustness
        6. Domain Adaptation
        7. Domain Shift
        8. Pneumonia
        9. Radiology
        10. Robustness

        Qualifiers

        • Research-article

        Funding Sources

        Conference

        ACM CHIL '20
        Sponsor:

        Acceptance Rates

        Overall Acceptance Rate 27 of 110 submissions, 25%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)127
        • Downloads (Last 6 weeks)13
        Reflects downloads up to 01 Dec 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)MV-MFF: Multi-View Multi-Feature Fusion Model for Pneumonia ClassificationDiagnostics10.3390/diagnostics1414156614:14(1566)Online publication date: 19-Jul-2024
        • (2024)Towards the Generation of Medical Imaging Classifiers Robust to Common PerturbationsBioMedInformatics10.3390/biomedinformatics40200504:2(889-910)Online publication date: 1-Apr-2024
        • (2024)AMIKOMNET: Novel Structure for a Deep Learning Model to Enhance COVID-19 Classification Task PerformanceBig Data and Cognitive Computing10.3390/bdcc80700778:7(77)Online publication date: 9-Jul-2024
        • (2024)Multi-modal deep learning methods for classification of chest diseases using different medical imaging and cough soundsPLOS ONE10.1371/journal.pone.029635219:3(e0296352)Online publication date: 12-Mar-2024
        • (2024)Revolutionizing Pneumonia Diagnosis: AI-Driven Deep Learning Framework for Automated Detection From Chest X-RaysIEEE Access10.1109/ACCESS.2024.349894412(171601-171616)Online publication date: 2024
        • (2024)Transparent medical image AI via an image–text foundation model grounded in medical literatureNature Medicine10.1038/s41591-024-02887-x30:4(1154-1165)Online publication date: 16-Apr-2024
        • (2024)A systematic review of generalization research in medical image classificationComputers in Biology and Medicine10.1016/j.compbiomed.2024.109256183(109256)Online publication date: Dec-2024
        • (2024)Classification of Pneumonia from Chest X-Ray Image Using Convolutional Neural NetworkICT: Innovation and Computing10.1007/978-981-99-9486-1_39(471-480)Online publication date: 18-Apr-2024
        • (2024)Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-label Medical Image ClassificationPattern Recognition and Computer Vision10.1007/978-981-97-8496-7_20(279-298)Online publication date: 3-Nov-2024
        • (2023)A Rest API to Classify Pneumonia Infection From Chest X-ray Images Using Multi-Layer Perceptron and LeNet2023 International Conference on Artificial Intelligence, Big Data, Computing and Data Communication Systems (icABCD)10.1109/icABCD59051.2023.10220479(1-6)Online publication date: 3-Aug-2023
        • Show More Cited By

        View Options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Login options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media