Nothing Special   »   [go: up one dir, main page]

Skip to main content
Log in

Vision Transformer Based Effective Model for Early Detection and Classification of Lung Cancer

  • Original Research
  • Published:
SN Computer Science Aims and scope Submit manuscript

Abstract

This study explores the worldwide effects of lung cancer and its early detection and diagnosis. Artificial intelligence (AI)-based models are quite popular among researchers in this field for early detection of lung cancer. Histopathology images are one of the popular means for diagnosis and detection of lung cancer. The present work encompasses the Vision Transformer (ViT) based model to classify lung cancer based on histopathological images. The proposed ViT-based model has been shown to have a promising impact in deciphering complex spatial relationships within image data. The proposed model has been validated by a publically available database, namely the LC25000 dataset, which contains lung and colon cancer histopathology images with variations in tissue types. The primary intuition behind using this ViT is to freeze the pre-trained ViT layers as a feature extractor and perform classification tasks by adding a new classification head. The model has been tested with various patch sizes during the model training. The proposed method achieved the best accuracy of 98.84% when the patch size was set to 16 × 16. Furthermore, the efficiency of the proposed work has been tabulated and compared work with existing work.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21

Similar content being viewed by others

Data availability

The datasets used in the current study are available from the corresponding author upon reasonable request.

References

  1. Ferlay J, Ervik M, Lam F, Colombet M, Mery L, Piñeros M, Bray F. Global cancer observatory: cancer today. Lyon: International Agency for Research on Cancer; 2021.

  2. https://www.who.int/news-room/fact-sheets/detail/cancer. Accessed June 2021.

  3. Molina JR, Yang P, Cassivi SD, Schild SE, Adjei AA. Non-small cell lung cancer: epidemiology, risk factors, treatment, and survivorship. Mayo Clin Proc. 2008;83(5):584–94. https://doi.org/10.4065/83.5.584.

    Article  Google Scholar 

  4. El-Regaily SA, Salem MA, Abdel Aziz MH, Roushdy MI. Survey of computer-aided detection systems for lung cancer in computed tomography. Curr Med Imaging. 2018;14(1):3–18.

    Article  Google Scholar 

  5. Carion N, Massa F, Synnaeve G, Usunier N, Kirillov A, Zagoruyko S. End-to-end object detection with transformers. 2020. arXiv:2005.12872.

  6. Zhu X, Su W, Lu L, Li B, Wang X, Dai J. Deformable DETR: deformable transformers for end-to-end object detection. 2020. arXiv:2010.04159.

  7. Touvron H, Cord M, Douze M, Massa F, Sablayrolles A, Jégou H. Training data-efficient image transformers & distillation through attention. 2020. arXiv:2012.12877.

  8. Ye l, Rochan M, Liu Z, Wang Y. Cross-modal self-attention network for referring image segmentation. In: 2019 IEEE/CVF conference on computer vision and pattern recognition (CVPR), Long Beach, CA, USA. 2019. p. 10494–503.

  9. Yu KH, Zhang C, Berry GJ, Altman RB, Ré C, Rubin DL, Snyder M. Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features. Nat Commun. 2016;7:12474.

    Article  Google Scholar 

  10. Cancer [online]. 2020. https://www.who.int/news-room/fact-sheets/detail/cancer.

  11. Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, Thrun S. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017;542(7639):115–8.

    Article  Google Scholar 

  12. Yamashita R, Nishio M, Do RKG, Togashi K. Convolutional neural networks: an overview and application in radiology. Insights Imaging. 2018;9(4):611–29.

    Article  Google Scholar 

  13. Ott M, Edunov S, Grangier D, Auli M. Scaling neural machine translation. In: Bojar O, Chatterjee R, Federmann C, et al., editors. Proceedings of the third conference on machine translation: research papers. Brussels: Association for Computational Linguistics; 2018.

  14. Fogel AL, Kvedar JC. Artificial intelligence powers digital medicine. NPJ Digit Med. 2018;1(1):5.

    Article  Google Scholar 

  15. Li H, et al. A generalized framework of feature learning enhanced convolutional neural network for pathology-image-oriented cancer diagnosis. Comput Biol Med. 2022;151:106265.

    Article  Google Scholar 

  16. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I. Attention is all you need. In: Guyon I, Von Luxburg U, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R, editors. Advances in neural information processing systems. Curran Associates Inc.; 2017.

    Google Scholar 

  17. Gabralla LA, et al. Automated diagnosis for colon cancer diseases using stacking transformer models and explainable artificial intelligence. Diagnostics. 2023;13(18):2939.

    Article  Google Scholar 

  18. Guo F-M, Fan Y. Zero-shot and few-shot learning for lung cancer multi-label classification using vision transformer. 2022. arXiv preprint arXiv:2205.15290.

  19. Zhang J, et al. Hepatocellular carcinoma histopathological images grading with a novel attention-sharing hybrid network based on multi-feature fusion. Biomed Signal Process Control. 2023;86:105126.

    Article  Google Scholar 

  20. Sun R, Pang Y, Li W. Efficient lung cancer image classification and segmentation algorithm based on an improved swin transformer. Electronics. 2023;12(4):1024.

    Article  Google Scholar 

  21. Singh O, Singh KK. An approach to classify lung and colon cancer of histopathology images using deep feature extraction and an ensemble method. Int J Inf Technol. 2023;15(8):4149–60.

    Google Scholar 

  22. Mehta R, et al. A vision transformer-based automated human identification using ear biometrics. J Inf Security Appl. 2023;78:103599.

    Google Scholar 

  23. Chen C, et al. Identifying primary tumor site of origin for liver metastases via a combination of handcrafted and deep learning features. J Pathol Clin Res. 2024. https://doi.org/10.1002/cjp2.344.

    Article  Google Scholar 

  24. Chhillar I, Singh A. A feature engineering-based machine learning technique to detect and classify lung and colon cancer from histopathological images. Med Biol Eng Comput. 2023: 1–12.

  25. Mehta R, Singh KK. Ear recognition system using averaging ensemble technique. In: Machine learning, image processing, network security and data sciences: 4th international conference, MIND 2022, Virtual Event, January 19–20, 2023, Proceedings, Part II. 2023. p. 220–9.

  26. Malaviya N, et al. Lvit: vision transformer for lung cancer detection. In: 2023 international conference on artificial intelligence and smart communication (AISC). New York: IEEE; 2023.

  27. Abimouloud ML, et al. Vision transformer-convolution for breast cancer classification using mammography images: a comparative study. Int J Hybrid Intell Syst. 1–17 (Preprint).

  28. Thakur SK, Singh DP, Choudhary J. Lung cancer identification: a review on detection and classification. Cancer Metastasis Rev. 2020;39:989–98.

    Article  Google Scholar 

  29. Mehta R, Ujjwal G, Shilpa SJ, Vityazev S, Singh KK. Rotation invariant 2D ear recognition using Gabor filters and ensemble of pre-trained deep convolutional neural network model. In: 2023 25th international conference on digital signal processing and its applications (DSPA). IEEE; 2023. p. 1–6.

  30. Mehta R, Singh KK. Ensemble of transfer learning and lightweight convolutional neural network model for an effective ear recognition system. Evol Syst. 2023;15(1):115–31.

    Article  Google Scholar 

  31. Mehta R, Singh KK. An efficient ear recognition technique based on deep ensemble learning approach. Evol Syst. 2024;15(3):771–87.

    Article  Google Scholar 

  32. Chaudhari S, Polatkan G, Ramanath R, Mithal V. An attentive survey of attention models. 2019. arXiv:1904.02874.

  33. Correia AS, Colombini EL. Attention, please! A survey of neural attention models in deep learning. 2021. arXiv:2103.16775.

  34. Bengio Y, Goodfellow I, Courville A. Deep learning. MIT Press; 2017.

  35. Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9(8):1735–80.

    Article  Google Scholar 

  36. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521:436–44.

    Article  Google Scholar 

  37. Devlin J, Chang M-W, Lee K, Toutanova K. BERT: Pre-training of deep bidirectional transformers for language understanding. 2018. arXiv:1810.04805.

  38. Singh O, Kashyap KL, Singh KK. Mesh-free technique for enhancement of the lung CT image. Biomed Signal Process Control. 2023;81:104452.

    Article  Google Scholar 

  39. Fedus W, Zoph B, Shazeer N. Switch transformers: scaling to trillion parameter models with simple and efficient sparsity. n.d. arXiv:2101.03961.

  40. Lepikhin D, Lee HJ, Xu Y, Chen D, Firat O, Huang Y, Krikun K, Shazeer N, Chen Z. Gshard: scaling giant models with conditional computation and automatic sharding. 2020. arXiv:2006.16668.

  41. Devlin J, Chang M-W, Lee K, Toutanova K. BERT: pre-training of deep bidirectional transformers for language understanding. 2018. arXiv:1810.04805.

  42. Zhao X, Xu J, Lin Z, Xue X. BiCFormer: Swin Transformer based model for classification of benign and malignant pulmonary nodules. Meas Sci Technol. 2024;35(7):075402.

    Article  Google Scholar 

  43. Mehta R, Sheikh-Akbari A, Singh KK. A noble approach to 2D ear recognition system using hybrid transfer learning. In: 2023 12th Mediterranean conference on embedded computing (MECO). New York: IEEE; 2023.

  44. Mehta R, Singh KK. An efficient ear recognition technique based on deep ensemble learning approach. Evol Syst. 2023. https://doi.org/10.1007/s12530-023-09505-0.

    Article  Google Scholar 

  45. Mehta R, Singh KK. Ear recognition system using averaging ensemble technique. In: International conference on machine learning, image processing, network security and data sciences. Cham: Springer Nature Switzerland; 2022.

  46. Goswami J, Singh KK. Pulmonary lung cancer classification using deep neural networks. In: Machine vision and augmented intelligence: select proceedings of MAI 2022 1007. 2023; p. 395.

  47. Saikia T, et al. Classification of lung nodules based on transfer learning with K-Nearest Neighbor (KNN). In: 2022 IEEE international conference on imaging systems and techniques (IST). New York: IEEE; 2022.

Download references

Acknowledgements

The authors wish to thank all the authors for preparing the manuscript.

Funding

No funding was received.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Arvind Kumar.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Research involving human participants

This article does not contain any studies involving human participants performed by any of the authors.

Informed consent

Informed consent was obtained from all the participants included in the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

See Table 9.

Table 9 Abbreviation used in the manuscript and their full form

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kumar, A., Mehta, R., Reddy, B.R. et al. Vision Transformer Based Effective Model for Early Detection and Classification of Lung Cancer. SN COMPUT. SCI. 5, 839 (2024). https://doi.org/10.1007/s42979-024-03120-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42979-024-03120-9

Keywords

Navigation