Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification

Published: 01 November 2022 Publication History

Abstract

Binary classification and anomaly detection face the problem of class imbalance in data sets. The contribution of this paper is to provide an ensemble model that improves image binary classification by reducing the class imbalance between the minority and majority classes in a data set. The ensemble model is a classifier of real images, synthetic images, and metadata associated with the real images. First, we apply a generative model to synthesize images of the minority class from the real image data set. Secondly, we train the ensemble model jointly with synthesized images of the minority class, real images, and metadata. Finally, we evaluate the model performance using a sensitivity metric to observe the difference in classification resulting from the adjustment of class imbalance. Improving the imbalance of the minority class by adding half the size of the majority class we observe an improvement in the classifier’s sensitivity by 12% and 24% for the benchmark pre-trained models of RESNET50 and DENSENet121 respectively.

References

[1]
Pang G, Shen C, Cao L, and Van Den Hengel A Deep learning for anomaly detection: a review ACM Comput Surv (CSUR) 2021 54 2 1-38
[2]
Pneumonia. https://www.who.int/news-room/fact-sheets/detail/pneumonia. Accessed 5 Dec 2021
[3]
Goldenberg JM, Cárdenas-Rodríguez J, and Pagel MD Preliminary results that assess metformin treatment in a preclinical model of pancreatic cancer using simultaneous [18F]FDG PET and acidoCEST MRI Mol Imaging Biol 2018 20 4 575-583
[4]
Noor NM, Rijal OM, Yunus A, and Abu-Bakar SAR A discrimination method for the detection of pneumonia using chest radiograph Comput Med Imaging Graph 2010 34 2 160-166
[5]
Oliveira LL, Silva SA, Ribeiro LH, de Oliveira RM, Coelho CJ, and Andrade ALS Computer-aided diagnosis in chest radiography for detection of childhood pneumonia Int J Med Inform 2008 77 8 555-564
[6]
Rajpurkar P, Irvin J, Zhu K, Yang B, Mehta H, Duan T, Ding D, Bagul A, Langlotz C, Shpanskaya K, Lungren MP, Ng AY (2017) Chexnet: radiologist-level pneumonia detection on chest x-rays with deep learning
[7]
Parveen NRS and Sathik MM Detection of pneumonia in chest x-ray images J X-Ray Sci Technol 2011 19 4 423-428
[8]
Zech JR, Badgeley MA, Liu M, Costa AB, Titano JJ, and Oermann EK Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: a cross-sectional study PLoS Med 2018 15 11 e1002683
[9]
Doshi K (2019) Synthetic image augmentation for improved classification using generative adversarial networks. arXiv:1907.13576
[10]
DeVries T, Taylor GW (2017) Improved regularization of convolutional neural networks with cutout. arXiv:1708.04552
[11]
May P (2019) Improved image augmentation for convolutional neural networks by copyout and CopyPairing. arXiv:1909.00390
[12]
Huang CC, Wu YL, Tang CY (2019) Human face sentiment classification using synthetic sentiment images with deep convolutional neural networks. In: 2019 International Conference on Machine Learning and Cybernetics (ICMLC). IEEE, pp 1–5
[13]
Ha Q, Liu B, Liu F (2020) Identifying melanoma images using EfficientNet ensemble: winning solution to the SIIM-ISIC melanoma classification challenge. arXiv:2010.05351
[14]
Calderisi M, Galatolo G, Ceppa I, Motta T, Vergentini F (2019) Improve image classification tasks using simple convolutional architectures with processed metadata injection. In: 2019 IEEE Second International Conference on Artificial Intelligence and Knowledge Engineering (AIKE). IEEE, pp 223–230
[15]
Zhu JY, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE international conference on computer vision, pp 2223–2232
[16]
Ningrum DNA, Yuan S-P, Kung W-M, Wu C-C, Tzeng I-S, Huang C-Y, Yu-Chuan Li J, and Wang Y-C Deep learning classifier with patient’s metadata of dermoscopic images in malignant melanoma detection J Multidiscip Healthc 2021 14 877-885
[17]
Phua C, Lee V, Smith K, Gayler R (2010) A comprehensive survey of data mining-based fraud detection research. arXiv:1009.6119
[18]
Zunair H and Hamza AB Synthesis of COVID-19 chest x-rays using unpaired image-to-image translation Soc Netw Anal Min 2021 11 1 23
[19]
Lequan Yu, Chen H, Dou Q, Qin J, and Heng P-A Automated melanoma recognition in dermoscopy images via very deep residual networks IEEE Trans Med Imaging 2017 36 4 994-1004
[20]
Shie CK, Chuang CH, Chou CN, Wu MH, Chang EY (2015) Transfer representation learning for medical image analysis. In: 2015 37th annual international conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE, pp 711–714
[21]
Shin H-C Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning IEEE Trans Med Imaging 2016 35 5 1285-1298
[22]
Qasim AB, Ezhov I, Shit S, Schoppe O, Paetzold JC, Sekuboyina A, et al. Red-GAN: attacking class imbalance via conditioned generation. Yet another perspective on medical image synthesis for skin lesion dermoscopy and brain tumor MRI Proc Mach Learn Res 2021 1 14
[23]
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S et al (2014) Generative adversarial nets. Adv Neural Inf Process Syst 27
[24]
Menon AK, Williamson RC (2018) The cost of fairness in binary classification. In: Conference on Fairness, accountability and transparency, pp 107–118. PMLR

Index Terms

  1. Ensemble method using real images, metadata and synthetic images for control of class imbalance in classification
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image Artificial Life and Robotics
        Artificial Life and Robotics  Volume 27, Issue 4
        Nov 2022
        272 pages

        Publisher

        Springer-Verlag

        Berlin, Heidelberg

        Publication History

        Published: 01 November 2022
        Accepted: 05 July 2022
        Received: 24 March 2022

        Author Tags

        1. Image classification
        2. Patient metadata
        3. Chest X-rays
        4. Pneumonia detection
        5. Imbalance data
        6. Image synthesis

        Qualifiers

        • Research-article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 0
          Total Downloads
        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 16 Dec 2024

        Other Metrics

        Citations

        View Options

        View options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media