Vision Transformer Based Effective Model for Early Detection and Classification of Lung Cancer

Arvind Kumar¹,
Ravishankar Mehta²,
B. Ramachandra Reddy¹ &
…
Koushlendra Kumar Singh¹

114 Accesses
1 Citation
Explore all metrics

Abstract

This study explores the worldwide effects of lung cancer and its early detection and diagnosis. Artificial intelligence (AI)-based models are quite popular among researchers in this field for early detection of lung cancer. Histopathology images are one of the popular means for diagnosis and detection of lung cancer. The present work encompasses the Vision Transformer (ViT) based model to classify lung cancer based on histopathological images. The proposed ViT-based model has been shown to have a promising impact in deciphering complex spatial relationships within image data. The proposed model has been validated by a publically available database, namely the LC25000 dataset, which contains lung and colon cancer histopathology images with variations in tissue types. The primary intuition behind using this ViT is to freeze the pre-trained ViT layers as a feature extractor and perform classification tasks by adding a new classification head. The model has been tested with various patch sizes during the model training. The proposed method achieved the best accuracy of 98.84% when the patch size was set to 16 × 16. Furthermore, the efficiency of the proposed work has been tabulated and compared work with existing work.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Machine Learning and Digital Image Processing in Lung Cancer Detection

A feature engineering-based machine learning technique to detect and classify lung and colon cancer from histopathological images

Article 13 December 2023

Comparing CNN-based and transformer-based models for identifying lung cancer: which is more effective?

Article Open access 20 December 2023

Data availability

The datasets used in the current study are available from the corresponding author upon reasonable request.

References

Ferlay J, Ervik M, Lam F, Colombet M, Mery L, Piñeros M, Bray F. Global cancer observatory: cancer today. Lyon: International Agency for Research on Cancer; 2021.
https://www.who.int/news-room/fact-sheets/detail/cancer. Accessed June 2021.
Molina JR, Yang P, Cassivi SD, Schild SE, Adjei AA. Non-small cell lung cancer: epidemiology, risk factors, treatment, and survivorship. Mayo Clin Proc. 2008;83(5):584–94. https://doi.org/10.4065/83.5.584.
Article Google Scholar
El-Regaily SA, Salem MA, Abdel Aziz MH, Roushdy MI. Survey of computer-aided detection systems for lung cancer in computed tomography. Curr Med Imaging. 2018;14(1):3–18.
Article Google Scholar
Carion N, Massa F, Synnaeve G, Usunier N, Kirillov A, Zagoruyko S. End-to-end object detection with transformers. 2020. arXiv:2005.12872.
Zhu X, Su W, Lu L, Li B, Wang X, Dai J. Deformable DETR: deformable transformers for end-to-end object detection. 2020. arXiv:2010.04159.
Touvron H, Cord M, Douze M, Massa F, Sablayrolles A, Jégou H. Training data-efficient image transformers & distillation through attention. 2020. arXiv:2012.12877.
Ye l, Rochan M, Liu Z, Wang Y. Cross-modal self-attention network for referring image segmentation. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR), Long Beach, CA, USA. 2019. p. 10494–503.
Yu KH, Zhang C, Berry GJ, Altman RB, Ré C, Rubin DL, Snyder M. Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features. Nat Commun. 2016;7:12474.
Article Google Scholar
Cancer [online]. 2020. https://www.who.int/news-room/fact-sheets/detail/cancer.
Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, Thrun S. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017;542(7639):115–8.
Article Google Scholar
Yamashita R, Nishio M, Do RKG, Togashi K. Convolutional neural networks: an overview and application in radiology. Insights Imaging. 2018;9(4):611–29.
Article Google Scholar
Ott M, Edunov S, Grangier D, Auli M. Scaling neural machine translation. In: Bojar O, Chatterjee R, Federmann C, et al., editors. Proceedings of the third conference on machine translation: research papers. Brussels: Association for Computational Linguistics; 2018.
Fogel AL, Kvedar JC. Artificial intelligence powers digital medicine. NPJ Digit Med. 2018;1(1):5.
Article Google Scholar
Li H, et al. A generalized framework of feature learning enhanced convolutional neural network for pathology-image-oriented cancer diagnosis. Comput Biol Med. 2022;151:106265.
Article Google Scholar
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I. Attention is all you need. In: Guyon I, Von Luxburg U, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R, editors. Advances in neural information processing systems. Curran Associates Inc.; 2017.
Google Scholar
Gabralla LA, et al. Automated diagnosis for colon cancer diseases using stacking transformer models and explainable artificial intelligence. Diagnostics. 2023;13(18):2939.
Article Google Scholar
Guo F-M, Fan Y. Zero-shot and few-shot learning for lung cancer multi-label classification using vision transformer. 2022. arXiv preprint arXiv:2205.15290.
Zhang J, et al. Hepatocellular carcinoma histopathological images grading with a novel attention-sharing hybrid network based on multi-feature fusion. Biomed Signal Process Control. 2023;86:105126.
Article Google Scholar
Sun R, Pang Y, Li W. Efficient lung cancer image classification and segmentation algorithm based on an improved swin transformer. Electronics. 2023;12(4):1024.
Article Google Scholar
Singh O, Singh KK. An approach to classify lung and colon cancer of histopathology images using deep feature extraction and an ensemble method. Int J Inf Technol. 2023;15(8):4149–60.
Google Scholar
Mehta R, et al. A vision transformer-based automated human identification using ear biometrics. J Inf Security Appl. 2023;78:103599.
Google Scholar
Chen C, et al. Identifying primary tumor site of origin for liver metastases via a combination of handcrafted and deep learning features. J Pathol Clin Res. 2024. https://doi.org/10.1002/cjp2.344.
Article Google Scholar
Chhillar I, Singh A. A feature engineering-based machine learning technique to detect and classify lung and colon cancer from histopathological images. Med Biol Eng Comput. 2023: 1–12.
Mehta R, Singh KK. Ear recognition system using averaging ensemble technique. In: Machine learning, image processing, network security and data sciences: 4th international conference, MIND 2022, Virtual Event, January 19–20, 2023, Proceedings, Part II. 2023. p. 220–9.
Malaviya N, et al. Lvit: vision transformer for lung cancer detection. In: 2023 international conference on artificial intelligence and smart communication (AISC). New York: IEEE; 2023.
Abimouloud ML, et al. Vision transformer-convolution for breast cancer classification using mammography images: a comparative study. Int J Hybrid Intell Syst. 1–17 (Preprint).
Thakur SK, Singh DP, Choudhary J. Lung cancer identification: a review on detection and classification. Cancer Metastasis Rev. 2020;39:989–98.
Article Google Scholar
Mehta R, Ujjwal G, Shilpa SJ, Vityazev S, Singh KK. Rotation invariant 2D ear recognition using Gabor filters and ensemble of pre-trained deep convolutional neural network model. In: 2023 25th international conference on digital signal processing and its applications (DSPA). IEEE; 2023. p. 1–6.
Mehta R, Singh KK. Ensemble of transfer learning and lightweight convolutional neural network model for an effective ear recognition system. Evol Syst. 2023;15(1):115–31.
Article Google Scholar
Mehta R, Singh KK. An efficient ear recognition technique based on deep ensemble learning approach. Evol Syst. 2024;15(3):771–87.
Article Google Scholar
Chaudhari S, Polatkan G, Ramanath R, Mithal V. An attentive survey of attention models. 2019. arXiv:1904.02874.
Correia AS, Colombini EL. Attention, please! A survey of neural attention models in deep learning. 2021. arXiv:2103.16775.
Bengio Y, Goodfellow I, Courville A. Deep learning. MIT Press; 2017.
Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9(8):1735–80.
Article Google Scholar
LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521:436–44.
Article Google Scholar
Devlin J, Chang M-W, Lee K, Toutanova K. BERT: Pre-training of deep bidirectional transformers for language understanding. 2018. arXiv:1810.04805.
Singh O, Kashyap KL, Singh KK. Mesh-free technique for enhancement of the lung CT image. Biomed Signal Process Control. 2023;81:104452.
Article Google Scholar
Fedus W, Zoph B, Shazeer N. Switch transformers: scaling to trillion parameter models with simple and efficient sparsity. n.d. arXiv:2101.03961.
Lepikhin D, Lee HJ, Xu Y, Chen D, Firat O, Huang Y, Krikun K, Shazeer N, Chen Z. Gshard: scaling giant models with conditional computation and automatic sharding. 2020. arXiv:2006.16668.
Devlin J, Chang M-W, Lee K, Toutanova K. BERT: pre-training of deep bidirectional transformers for language understanding. 2018. arXiv:1810.04805.
Zhao X, Xu J, Lin Z, Xue X. BiCFormer: Swin Transformer based model for classification of benign and malignant pulmonary nodules. Meas Sci Technol. 2024;35(7):075402.
Article Google Scholar
Mehta R, Sheikh-Akbari A, Singh KK. A noble approach to 2D ear recognition system using hybrid transfer learning. In: 2023 12th Mediterranean conference on embedded computing (MECO). New York: IEEE; 2023.
Mehta R, Singh KK. An efficient ear recognition technique based on deep ensemble learning approach. Evol Syst. 2023. https://doi.org/10.1007/s12530-023-09505-0.
Article Google Scholar
Mehta R, Singh KK. Ear recognition system using averaging ensemble technique. In: International conference on machine learning, image processing, network security and data sciences. Cham: Springer Nature Switzerland; 2022.
Goswami J, Singh KK. Pulmonary lung cancer classification using deep neural networks. In: Machine vision and augmented intelligence: select proceedings of MAI 2022 1007. 2023; p. 395.
Saikia T, et al. Classification of lung nodules based on transfer learning with K-Nearest Neighbor (KNN). In: 2022 IEEE international conference on imaging systems and techniques (IST). New York: IEEE; 2022.

Download references

Acknowledgements

The authors wish to thank all the authors for preparing the manuscript.

Funding

No funding was received.

Author information

Authors and Affiliations

Machine Vision and Intelligence Lab, Department of CSE, National Institute of Technology Jamshedpur, Jamshedpur, Jharkhand, 831014, India
Arvind Kumar, B. Ramachandra Reddy & Koushlendra Kumar Singh
Indian Institute of Information Technology Bhagalpur, Bhagalpur, Bihar, 813210, India
Ravishankar Mehta

Authors

Arvind Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Ravishankar Mehta
View author publications
You can also search for this author in PubMed Google Scholar
B. Ramachandra Reddy
View author publications
You can also search for this author in PubMed Google Scholar
Koushlendra Kumar Singh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Arvind Kumar.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Research involving human participants

This article does not contain any studies involving human participants performed by any of the authors.

Informed consent

Informed consent was obtained from all the participants included in the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

See Table 9.

Table 9 Abbreviation used in the manuscript and their full form

Full size table

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Kumar, A., Mehta, R., Reddy, B.R. et al. Vision Transformer Based Effective Model for Early Detection and Classification of Lung Cancer. SN COMPUT. SCI. 5, 839 (2024). https://doi.org/10.1007/s42979-024-03120-9

Download citation

Received: 08 March 2024
Accepted: 07 July 2024
Published: 29 August 2024
DOI: https://doi.org/10.1007/s42979-024-03120-9

Vision Transformer Based Effective Model for Early Detection and Classification of Lung Cancer

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Machine Learning and Digital Image Processing in Lung Cancer Detection

A feature engineering-based machine learning technique to detect and classify lung and colon cancer from histopathological images

Comparing CNN-based and transformer-based models for identifying lung cancer: which is more effective?

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Research involving human participants

Informed consent

Additional information

Publisher's Note

Appendix

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Vision Transformer Based Effective Model for Early Detection and Classification of Lung Cancer

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Machine Learning and Digital Image Processing in Lung Cancer Detection

A feature engineering-based machine learning technique to detect and classify lung and colon cancer from histopathological images

Comparing CNN-based and transformer-based models for identifying lung cancer: which is more effective?

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Research involving human participants

Informed consent

Additional information

Publisher's Note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation