Abstract
This study explores the worldwide effects of lung cancer and its early detection and diagnosis. Artificial intelligence (AI)-based models are quite popular among researchers in this field for early detection of lung cancer. Histopathology images are one of the popular means for diagnosis and detection of lung cancer. The present work encompasses the Vision Transformer (ViT) based model to classify lung cancer based on histopathological images. The proposed ViT-based model has been shown to have a promising impact in deciphering complex spatial relationships within image data. The proposed model has been validated by a publically available database, namely the LC25000 dataset, which contains lung and colon cancer histopathology images with variations in tissue types. The primary intuition behind using this ViT is to freeze the pre-trained ViT layers as a feature extractor and perform classification tasks by adding a new classification head. The model has been tested with various patch sizes during the model training. The proposed method achieved the best accuracy of 98.84% when the patch size was set to 16 × 16. Furthermore, the efficiency of the proposed work has been tabulated and compared work with existing work.
Similar content being viewed by others
Data availability
The datasets used in the current study are available from the corresponding author upon reasonable request.
References
Ferlay J, Ervik M, Lam F, Colombet M, Mery L, Piñeros M, Bray F. Global cancer observatory: cancer today. Lyon: International Agency for Research on Cancer; 2021.
https://www.who.int/news-room/fact-sheets/detail/cancer. Accessed June 2021.
Molina JR, Yang P, Cassivi SD, Schild SE, Adjei AA. Non-small cell lung cancer: epidemiology, risk factors, treatment, and survivorship. Mayo Clin Proc. 2008;83(5):584–94. https://doi.org/10.4065/83.5.584.
El-Regaily SA, Salem MA, Abdel Aziz MH, Roushdy MI. Survey of computer-aided detection systems for lung cancer in computed tomography. Curr Med Imaging. 2018;14(1):3–18.
Carion N, Massa F, Synnaeve G, Usunier N, Kirillov A, Zagoruyko S. End-to-end object detection with transformers. 2020. arXiv:2005.12872.
Zhu X, Su W, Lu L, Li B, Wang X, Dai J. Deformable DETR: deformable transformers for end-to-end object detection. 2020. arXiv:2010.04159.
Touvron H, Cord M, Douze M, Massa F, Sablayrolles A, Jégou H. Training data-efficient image transformers & distillation through attention. 2020. arXiv:2012.12877.
Ye l, Rochan M, Liu Z, Wang Y. Cross-modal self-attention network for referring image segmentation. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR), Long Beach, CA, USA. 2019. p. 10494–503.
Yu KH, Zhang C, Berry GJ, Altman RB, Ré C, Rubin DL, Snyder M. Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features. Nat Commun. 2016;7:12474.
Cancer [online]. 2020. https://www.who.int/news-room/fact-sheets/detail/cancer.
Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, Thrun S. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017;542(7639):115–8.
Yamashita R, Nishio M, Do RKG, Togashi K. Convolutional neural networks: an overview and application in radiology. Insights Imaging. 2018;9(4):611–29.
Ott M, Edunov S, Grangier D, Auli M. Scaling neural machine translation. In: Bojar O, Chatterjee R, Federmann C, et al., editors. Proceedings of the third conference on machine translation: research papers. Brussels: Association for Computational Linguistics; 2018.
Fogel AL, Kvedar JC. Artificial intelligence powers digital medicine. NPJ Digit Med. 2018;1(1):5.
Li H, et al. A generalized framework of feature learning enhanced convolutional neural network for pathology-image-oriented cancer diagnosis. Comput Biol Med. 2022;151:106265.
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I. Attention is all you need. In: Guyon I, Von Luxburg U, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R, editors. Advances in neural information processing systems. Curran Associates Inc.; 2017.
Gabralla LA, et al. Automated diagnosis for colon cancer diseases using stacking transformer models and explainable artificial intelligence. Diagnostics. 2023;13(18):2939.
Guo F-M, Fan Y. Zero-shot and few-shot learning for lung cancer multi-label classification using vision transformer. 2022. arXiv preprint arXiv:2205.15290.
Zhang J, et al. Hepatocellular carcinoma histopathological images grading with a novel attention-sharing hybrid network based on multi-feature fusion. Biomed Signal Process Control. 2023;86:105126.
Sun R, Pang Y, Li W. Efficient lung cancer image classification and segmentation algorithm based on an improved swin transformer. Electronics. 2023;12(4):1024.
Singh O, Singh KK. An approach to classify lung and colon cancer of histopathology images using deep feature extraction and an ensemble method. Int J Inf Technol. 2023;15(8):4149–60.
Mehta R, et al. A vision transformer-based automated human identification using ear biometrics. J Inf Security Appl. 2023;78:103599.
Chen C, et al. Identifying primary tumor site of origin for liver metastases via a combination of handcrafted and deep learning features. J Pathol Clin Res. 2024. https://doi.org/10.1002/cjp2.344.
Chhillar I, Singh A. A feature engineering-based machine learning technique to detect and classify lung and colon cancer from histopathological images. Med Biol Eng Comput. 2023: 1–12.
Mehta R, Singh KK. Ear recognition system using averaging ensemble technique. In: Machine learning, image processing, network security and data sciences: 4th international conference, MIND 2022, Virtual Event, January 19–20, 2023, Proceedings, Part II. 2023. p. 220–9.
Malaviya N, et al. Lvit: vision transformer for lung cancer detection. In: 2023 international conference on artificial intelligence and smart communication (AISC). New York: IEEE; 2023.
Abimouloud ML, et al. Vision transformer-convolution for breast cancer classification using mammography images: a comparative study. Int J Hybrid Intell Syst. 1–17 (Preprint).
Thakur SK, Singh DP, Choudhary J. Lung cancer identification: a review on detection and classification. Cancer Metastasis Rev. 2020;39:989–98.
Mehta R, Ujjwal G, Shilpa SJ, Vityazev S, Singh KK. Rotation invariant 2D ear recognition using Gabor filters and ensemble of pre-trained deep convolutional neural network model. In: 2023 25th international conference on digital signal processing and its applications (DSPA). IEEE; 2023. p. 1–6.
Mehta R, Singh KK. Ensemble of transfer learning and lightweight convolutional neural network model for an effective ear recognition system. Evol Syst. 2023;15(1):115–31.
Mehta R, Singh KK. An efficient ear recognition technique based on deep ensemble learning approach. Evol Syst. 2024;15(3):771–87.
Chaudhari S, Polatkan G, Ramanath R, Mithal V. An attentive survey of attention models. 2019. arXiv:1904.02874.
Correia AS, Colombini EL. Attention, please! A survey of neural attention models in deep learning. 2021. arXiv:2103.16775.
Bengio Y, Goodfellow I, Courville A. Deep learning. MIT Press; 2017.
Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9(8):1735–80.
LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521:436–44.
Devlin J, Chang M-W, Lee K, Toutanova K. BERT: Pre-training of deep bidirectional transformers for language understanding. 2018. arXiv:1810.04805.
Singh O, Kashyap KL, Singh KK. Mesh-free technique for enhancement of the lung CT image. Biomed Signal Process Control. 2023;81:104452.
Fedus W, Zoph B, Shazeer N. Switch transformers: scaling to trillion parameter models with simple and efficient sparsity. n.d. arXiv:2101.03961.
Lepikhin D, Lee HJ, Xu Y, Chen D, Firat O, Huang Y, Krikun K, Shazeer N, Chen Z. Gshard: scaling giant models with conditional computation and automatic sharding. 2020. arXiv:2006.16668.
Devlin J, Chang M-W, Lee K, Toutanova K. BERT: pre-training of deep bidirectional transformers for language understanding. 2018. arXiv:1810.04805.
Zhao X, Xu J, Lin Z, Xue X. BiCFormer: Swin Transformer based model for classification of benign and malignant pulmonary nodules. Meas Sci Technol. 2024;35(7):075402.
Mehta R, Sheikh-Akbari A, Singh KK. A noble approach to 2D ear recognition system using hybrid transfer learning. In: 2023 12th Mediterranean conference on embedded computing (MECO). New York: IEEE; 2023.
Mehta R, Singh KK. An efficient ear recognition technique based on deep ensemble learning approach. Evol Syst. 2023. https://doi.org/10.1007/s12530-023-09505-0.
Mehta R, Singh KK. Ear recognition system using averaging ensemble technique. In: International conference on machine learning, image processing, network security and data sciences. Cham: Springer Nature Switzerland; 2022.
Goswami J, Singh KK. Pulmonary lung cancer classification using deep neural networks. In: Machine vision and augmented intelligence: select proceedings of MAI 2022 1007. 2023; p. 395.
Saikia T, et al. Classification of lung nodules based on transfer learning with K-Nearest Neighbor (KNN). In: 2022 IEEE international conference on imaging systems and techniques (IST). New York: IEEE; 2022.
Acknowledgements
The authors wish to thank all the authors for preparing the manuscript.
Funding
No funding was received.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Research involving human participants
This article does not contain any studies involving human participants performed by any of the authors.
Informed consent
Informed consent was obtained from all the participants included in the study.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
See Table 9.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kumar, A., Mehta, R., Reddy, B.R. et al. Vision Transformer Based Effective Model for Early Detection and Classification of Lung Cancer. SN COMPUT. SCI. 5, 839 (2024). https://doi.org/10.1007/s42979-024-03120-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-024-03120-9