CLINICAL: Targeted Active Learning for Imbalanced Medical Image Classification

Suraj Kothawade¹³,
Atharv Savarkar¹⁴,
Venkat Iyer¹⁴,
Ganesh Ramakrishnan¹⁴ &
…
Rishabh Iyer¹³

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13559))

Included in the following conference series:

Workshop on Medical Image Learning with Limited and Noisy Data

818 Accesses

Abstract

Training deep learning models on medical datasets that perform well for all classes is a challenging task. It is often the case that a suboptimal performance is obtained on some classes due to the natural class imbalance issue that comes with medical data. An effective way to tackle this problem is by using targeted active learning, where we iteratively add data points that belong to the rare classes, to the training data. However, existing active learning methods are ineffective in targeting rare classes in medical datasets. In this work, we propose Clinical (targeted aCtive Learning for ImbalaNced medICal imAge cLassification) a framework that uses submodular mutual information functions as acquisition functions to mine critical data points from rare classes. We apply our framework to a wide-array of medical imaging datasets on a variety of real-world class imbalance scenarios - namely, binary imbalance and long-tail imbalance. We show that Clinical outperforms the state-of-the-art active learning methods by acquiring a diverse set of data points that belong to the rare classes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

BPaCo: Balanced Parametric Contrastive Learning for Long-Tailed Medical Image Classification

Balanced-MixUp for Highly Imbalanced Medical Image Classification

Evaluation of Active Learning Techniques on Medical Image Classification with Unbalanced Data Distributions

References

Acevedo, A., Merino, A., Alférez, S., Molina, Á., Boldú, L., Rodellar, J.: A dataset of microscopic peripheral blood cell images for development of automatic recognition systems. Data Brief 30 (2020). ISSN 2352-3409
Google Scholar
Arthur, D., Vassilvitskii, S.: k-means++: the advantages of careful seeding. In: SODA 2007: Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 1027–1035. Society for Industrial and Applied Mathematics, Philadelphia (2007)
Google Scholar
Ash, J.T., Zhang, C., Krishnamurthy, A., Langford, J., Agarwal, A.: Deep batch active learning by diverse, uncertain gradient lower bounds. In: ICLR (2020)
Google Scholar
Codella, N., et al.: Skin lesion analysis toward melanoma detection 2018: a challenge hosted by the international skin imaging collaboration (ISIC). arXiv preprint arXiv:1902.03368 (2019)
Fujishige, S.: Submodular Functions and Optimization. Elsevier, Amsterdam (2005)
MATH Google Scholar
Gupta, A., Levin, R.: The online submodular cover problem. In: ACM-SIAM Symposium on Discrete Algorithms (2020)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Iyer, R., Khargoankar, N., Bilmes, J., Asnani, H.: Submodular combinatorial information measures with applications in machine learning. arXiv preprint arXiv:2006.15412 (2020)
Iyer, R.K.: Submodular optimization and machine learning: theoretical results, unifying and scalable algorithms, and applications. Ph.D. thesis (2015)
Google Scholar
Kaggle: Aptos 2019 blindness detection (2019). https://www.kaggle.com/c/aptos2019- blindness-detection/data
Kather, J.N., et al.: Predicting survival from colorectal cancer histology slides using deep learning: a retrospective multicenter study. PLoS Med. 16(1), e1002730 (2019)
Article Google Scholar
Kermany, D.S., et al.: Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell 172(5), 1122–1131 (2018)
Article Google Scholar
Killamsetty, K., Durga, S., Ramakrishnan, G., De, A., Iyer, R.: Grad-match: gradient matching based data subset selection for efficient deep model training. In: International Conference on Machine Learning, pp. 5464–5474. PMLR (2021)
Google Scholar
Killamsetty, K., Sivasubramanian, D., Ramakrishnan, G., Iyer, R.: Glister: generalization based data subset selection for efficient and robust learning. In: AAAI (2021)
Google Scholar
Kirsch, A., Van Amersfoort, J., Gal, Y.: Batchbald: efficient and diverse batch acquisition for deep Bayesian active learning. arXiv preprint arXiv:1906.08158 (2019)
Kothawade, S., Beck, N., Killamsetty, K., Iyer, R.: Similar: submodular information measures based active learning in realistic scenarios. In: Advances in Neural Information Processing Systems, vol. 34 (2021)
Google Scholar
Kothawade, S., Ghosh, S., Shekhar, S., Xiang, Y., Iyer, R.: Talisman: targeted active learning for object detection with rare classes and slices using submodular mutual information. arXiv preprint arXiv:2112.00166 (2021)
Kothawade, S., Kaushal, V., Ramakrishnan, G., Bilmes, J., Iyer, R.: Prism: a rich class of parameterized submodular information measures for guided subset selection. arXiv preprint arXiv:2103.00128 (2021)
Kothyari, M., Mekala, A.R., Iyer, R., Ramakrishnan, G., Jyothi, P.: Personalizing ASR with limited data using targeted subset selection. arXiv preprint arXiv:2110.04908 (2021)
Li, J., Li, L., Li, T.: Multi-document summarization via submodularity. Appl. Intell. 37(3), 420–430 (2012)
Article Google Scholar
Lin, H.: Submodularity in natural language processing: algorithms and applications. Ph.D. thesis (2012)
Google Scholar
Loshchilov, I., Hutter, F.: SGDR: stochastic gradient descent with warm restarts. arXiv preprint arXiv:1608.03983 (2016)
Mirzasoleiman, B., Badanidiyuru, A., Karbasi, A., Vondrák, J., Krause, A.: Lazier than lazy greedy. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 29 (2015)
Google Scholar
Roth, D., Small, K.: Margin-based active learning for structured output spaces. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 413–424. Springer, Heidelberg (2006). https://doi.org/10.1007/11871842_40
Chapter Google Scholar
Sener, O., Savarese, S.: Active learning for convolutional neural networks: a core-set approach. In: International Conference on Learning Representations (2018)
Google Scholar
Settles, B.: Active learning literature survey. Technical report, University of Wisconsin-Madison, Department of Computer Sciences (2009)
Google Scholar
Vasudevan, A.B., Gygli, M., Volokitin, A., Van Gool, L.: Query-adaptive video summarization via quality-aware relevance estimation. In: Proceedings of the 25th ACM International Conference on Multimedia, pp. 582–590 (2017)
Google Scholar
Wang, D., Shang, Y.: A new active labeling method for deep learning. In: 2014 International Joint Conference on Neural Networks (IJCNN), pp. 112–119. IEEE (2014)
Google Scholar
Yang, J., et al.: MedMNIST v2: A large-scale lightweight benchmark for 2D and 3D biomedical image classification. arXiv preprint arXiv:2008 (2021)

Download references

Author information

Authors and Affiliations

University of Texas at Dallas, Richardson, USA
Suraj Kothawade & Rishabh Iyer
Indian Institute of Technology, Bombay, Mumbai, India
Atharv Savarkar, Venkat Iyer & Ganesh Ramakrishnan

Authors

Suraj Kothawade
View author publications
You can also search for this author in PubMed Google Scholar
Atharv Savarkar
View author publications
You can also search for this author in PubMed Google Scholar
Venkat Iyer
View author publications
You can also search for this author in PubMed Google Scholar
Ganesh Ramakrishnan
View author publications
You can also search for this author in PubMed Google Scholar
Rishabh Iyer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Suraj Kothawade .

Editor information

Editors and Affiliations

National Institutes of Health, Bethesda, MD, USA
Ghada Zamzmi
National Institutes of Health, Bethesda, MD, USA
Sameer Antani
Northwestern University, Chicago, IL, USA
Ulas Bagci
Children's National Hospital, Washington, WA, USA
Marius George Linguraru
National Institutes of Health, Bethesda, MD, USA
Sivaramakrishnan Rajaraman
National Institutes of Health, Bethesda, MD, USA
Zhiyun Xue

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 2147 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kothawade, S., Savarkar, A., Iyer, V., Ramakrishnan, G., Iyer, R. (2022). CLINICAL: Targeted Active Learning for Imbalanced Medical Image Classification. In: Zamzmi, G., Antani, S., Bagci, U., Linguraru, M.G., Rajaraman, S., Xue, Z. (eds) Medical Image Learning with Limited and Noisy Data. MILLanD 2022. Lecture Notes in Computer Science, vol 13559. Springer, Cham. https://doi.org/10.1007/978-3-031-16760-7_12

Download citation

DOI: https://doi.org/10.1007/978-3-031-16760-7_12
Published: 15 September 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-16759-1
Online ISBN: 978-3-031-16760-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

CLINICAL: Targeted Active Learning for Imbalanced Medical Image Classification

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

BPaCo: Balanced Parametric Contrastive Learning for Long-Tailed Medical Image Classification

Balanced-MixUp for Highly Imbalanced Medical Image Classification

Evaluation of Active Learning Techniques on Medical Image Classification with Unbalanced Data Distributions

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 2147 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

CLINICAL: Targeted Active Learning for Imbalanced Medical Image Classification

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

BPaCo: Balanced Parametric Contrastive Learning for Long-Tailed Medical Image Classification

Balanced-MixUp for Highly Imbalanced Medical Image Classification

Evaluation of Active Learning Techniques on Medical Image Classification with Unbalanced Data Distributions

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 2147 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation