Abstract
Nowadays, the most frequent cancer in women is breast cancer (malignant tumor). If breast cancer is detected at the beginning stage, it can often be cured. Many researchers proposed numerous methods for early prediction of this Cancer. In this paper, we proposed feature ensemble learning based on Sparse Autoencoders and Softmax Regression for classification of Breast Cancer into benign (non-cancerous) and malignant (cancerous). We used Breast Cancer Wisconsin (Diagnostic) medical data sets from the UCI machine learning repository. The proposed method is assessed using various performance indices like true classification accuracy, specificity, sensitivity, recall, precision, f measure, and MCC. Simulation and result proved that the proposed approach gives better results in terms of different parameters. The prediction results obtained by the proposed approach were very promising (98.60% true accuracy). In addition, the proposed method outperforms the Stacked Sparse Autoencoders and Softmax Regression based (SSAE-SM) model and other State-of-the-art classifiers in terms of various performance indices. Experimental simulations, empirical results, and statistical analyses are also showing that the proposed model is an efficient and beneficial model for classification of Breast Cancer. It is also comparable with the existing machine learning and soft computing approaches present in the related literature.
Similar content being viewed by others
References
Breast Cancer Awareness Month. National Health Portal Of India, 10 Apr. 2015, (www.nhp.gov.in/breast-cancer-awareness-month_pg)
Hashemi, S.H.B., Karimi, S., and Mahboobi, H., Lifestyle changes for prevention of breast cancer. Electron. Physician 6(3):894–905, 2014.
Yu, Y.H., Wei, W., and Liu, J.L., Diagnostic value of fine-needle aspiration biopsy for breast mass: A systematic review and meta-analysis. BMC Cancer 12:41, 2012. https://doi.org/10.1186/1471-2407-12-41.
Madubogwu, C.I., Ukah, C.O., Onyiaorah, I.V., Anyiam, D.C.D., Anyanwu, S.N.C., and Chianakwana, G.U.: Cost effectiveness of fine needle aspiration cytology for breast masses. Orient J. Med. 27(1–2), 2015
Iranpour, M., Almassi, S., and Analoui, M.: Breast cancer detection from FNA using SVM and RBF classifier. In: First Joint Congress on Fuzzy and Intelligent Systems, Ferdowsi University of Mashhad, Iran, 29–31 Aug 2007, 2007.
Mert, A., et al.: Breast cancer detection with reduced feature set computational and mathematical methods in medicine Volume 2015. https://doi.org/10.1155/2015/265138
Luo, Z., Wu, X., Guo, S., Ye, B., Guo, S., and Ye, B.: Diagnosis of breast cancer tumor based on manifold learning and support vector machine. In: IEEE International Conference on Information and Automation June 20–23, 2008, Zhangjiajie, China, 2008.
Muthu Rama Krishnan, M., Banerjee, S., Chakraborty, C., Chakraborty, C., and Ray, A.K., Statistical analysis of mammographic features and its classification using support vector machine. Expert Syst. Appl. 37:470–478, 2010.
Zheng, B., Yoon, S.W., and Lam, S.S., Breast cancer diagnosis based on feature extraction using a hybrid of K-means and support vector machine algorithms. Expert Syst. Appl. 41(4):1476–1482, 2014. https://doi.org/10.1016/j.eswa.2013.08.044.
Bamakan, S.M.H., and Gholami, P., A novel feature selection method based on an integrated data envelopment analysis and entropy model. Prog. Comput. Sci. 31:632–638, 2014.
Guo, H., and Nandi, A.K.: Breast cancer diagnosis using genetic programming generated feature. In: 2005 IEEE Workshop on Machine Learning for Signal Processing, Mystic, CT. https://doi.org/10.1109/MLSP.2005.1532902, pp. 215–220, 2005.
Prasad, Y., Biswas, K.K., and Jain, C.K.: SVM classifier based feature selection using GA, ACO and PSO for siRNA design. In: Tan, Y., Shi, Y., and Tan, K.C (Eds.) , Advances in Swarm Intelligence. ICSI 2010. Lecture Notes in Computer Science, Vol. 6146. Springer, Berlin, 2010.
Maldonado, S., Weber, R., and Basak, J., Simultaneous feature selection and classification using kernel-penalized support vector machines. Inf. Sci. 181:115–128, 2011.
Jafari-Marandi, R., Davarzani, S., Gharibdousti, M.S., and Smith, B.K., An optimum ANN-based breast cancer diagnosis: Bridging gaps between ANN learning and decision-making goals. Appl. Soft Comput. 72:108–120, 2018.
Naga RamaDevi, G., Usha Rani, K., and Lavanya, D.: Ensemble-based hybrid approach for breast cancer data. In: International Conference on Communications and Cyber Physical Engineering 2018. Springer, Singapore, 2018.
Wang, H., Zheng, B., Yoon, S.W., and Ko, H.S., A support vector machine-based ensemble algorithm for breast cancer diagnosis. Eur. J. Oper. Res. 267(2):687–699, 2018.
Salama, G.I., Abdelhalim, M.B., and Abd-elghany Zeid, M., Breast cancer diagnosis on three different datasets using multi-classifiers. Int. J. Comput. Inf. Technol. 1(Issue 01):2277–0764, 2012.
Luukka, P., and Leppalampi, T., Similarity classifier with generalized mean applied to medical data. Comput. Biol. Med. 36:1026–1040, 2006.
Lavanya, D., and Usha Rani, K.: Analysis of feature selection with classification: Breast cancer datasets. Indian J. Comput. Sci. Eng. 2(5), 2011. ISSN : 0976-5166
Zhao, J.Y., and Zhang, Z.L.: Fuzzy rough neural network and its application to feature selection. In: The Fourth International Workshop on Advanced Computational Intelligence, Wuhan. pp 684–687, 2011. https://doi.org/10.1109/IWACI.2011.6160094
Mert, A., Kılıç, N., and Akan, A., An improved hybrid feature reduction for increased breast cancer diagnostic performance. Biomed. Eng. Lett. 4(3):285–291, 2014.
Lim, C.K., and Chan, C.S., A weighted inference engine based on interval-valued fuzzy relational theory. Expert Syst. Appl. 42:3410–3419, 2015.
Emami, N., and Pakzad, A., A new knowledge-based system for diagnosis of breast cancer by a combination of affinity propagation clustering and firefly algorithm. J. AI Data Min. 7:59–68, 2018.
Sheikhpour, R., Sarram, M.A., and Sheikhpour, R., Particle swarm optimization for bandwidth determination and feature selection of kernel density estimation based classifiers in diagnosis of breast cancer. Appl. Soft Comput. 40:113–131, 2016.
Liu, N., Qi, E.-S., Xu, M., Gao, B., and Liu, G.-Q., A novel intelligent classification model for breast cancer diagnosis. Inf. Process. Manag. 56(3):609–623, 2019.
Xue, B., Zhang, M., and Browne, W.N.: New fitness functions in binary particle swarm optimisation for feature selection. In: WCCI 2012 IEEE World Congress on Computational Intelligence June, 10–15, 2012 - Brisbane, Australia, 2012.
Xue, B., Zhang, M., and Browne, W.N., Particle swarm optimisation for feature selection in classification: Novel initialisation and updating mechanisms. Appl. Soft Comput. 18:261–276, 2014.
Kim, S., Kavuri, S., and Lee, M.: Deep network with support vector machines. In: Lee, M., Hirose, A., Hou, Z. G., and Kil, R. M. (Eds.) , Neural Information Processing. ICONIP 2013. Lecture Notes in Computer Science, Vol. 8226. Springer, Berlin, 2013.
Abdel-Zaher, A.M., and Eldeib, A.M.: Breast cancer classification using deep belief networks. Expert Syst. Appl., 2015. https://doi.org/10.1016/j.eswa.2015.10.015
Xu, J., et al., Stacked Sparse Autoencoder (SSAE) for nuclei detection on breast cancer histopathology images. IEEE Trans. Med. Imaging 35(1):119–130, 2016. https://doi.org/10.1109/TMI.2015.2458702.
Zhang, Q., et al., Deep learning based classification of breast tumors with shear-wave elastography. Ultrasonics 72:150–157, 2016.
Cano, F., Madabhushi, A., and Cruz-Roa, A.: A comparative analysis of sensitivity of convolutional neural networks for histopathology image classification in breast cancer. In: Proceedings Volume 10975, 14th International Symposium on Medical Information Processing and Analysis; 109750W. Mazatlán, Mexico. https://doi.org/10.1117/12.2511647, 2018.
Ragab, D.A., et al., Breast cancer detection using deep convolutional neural networks and support vector machines. PeerJ 7:e6201, 2018.
Kooi, T., et al., Large scale deep learning for computer aided detection of mammographic lesions. Med. Image Anal. 35:303–312, 2017.
Chougrad, H., Zouaki, H., and Alheyane, O., Deep convolutional neural networks for breast cancer screening. Comput Methods Programs Biomed. 157:19–30, 2018. https://doi.org/10.1016/j.cmpb.2018.01.011. Epub 2018 Jan 11.
Xiao, T., Liu, L., Li, K., Qin, W., Yu, S., and Li, Z., Comparison of transferred deep neural networks in ultrasonic breast masses discrimination. Biomed. Res. Int. 2018:Article ID 4605191, 2018. https://doi.org/10.1155/2018/4605191.
Cruz-Roa, A., et al., Accurate and reproducible invasive breast cancer detection in whole-slide images: A deep learning approach for quantifying tumor extent. Sci. Rep. 7:46450, 2017.
Jadoon, M.M., Zhang, Q., Ul Haq, I., Butt, S., and Jadoon, A.: Three-class mammogram classification based on descriptive CNN features. Biomed. Res. Int. 2017, Article ID 3640901, 11 pages, 2017. https://doi.org/10.1155/2017/3640901
Agrawal, S., Rangnekar, R., Gala, D., Paul, S., and Kalbande, D: Detection of breast cancer from mammograms using a hybrid approach of deep learning and linear classification. In: 2018 International Conference on Smart City and Emerging Technology (ICSCET), Mumbai. https://doi.org/10.1109/ICSCET.2018.8537250, pp. 1–6, 2018.
Liu, K., Kang, G., Zhang, N., and Hou, B., Breast cancer classification based on fully-connected layer first convolutional neural networks. IEEE Access 6:23722–23732, 2018. https://doi.org/10.1109/ACCESS.2018.2817593.
Xiao, Y., Wu, J., Lin, Z., and Zhao, X.: Breast cancer diagnosis using an unsupervised feature extraction algorithm based on deep learning. In: Proceedings of the 37th Chinese Control Conference July 25–27. Wuhan, 2018.
Vijayakumar, K., and Arun, C.: Automated risk identification using NLP in cloud based development environments. J. Ambient Intell. Human Comput., 2017. https://doi.org/10.1007/s12652-017-0503-7
Kadam, V.J., Yadav, S.S., and Jadhav, S.M.: Soft-margin SVM incorporating feature selection using improved elitist GA for arrhythmia classification. In: Abraham, A., Cherukuri, A., Melin, P., and Gandhi, N. (Eds.) , Intelligent Systems Design and Applications. ISDA 2018. Advances in Intelligent Systems and Computing, Vol. 941. Springer, Cham, 2018.
Lu, Y., Zhang, L., Wang, B., and Yang, J.: Feature ensemble learning based on sparse autoencoders for image classification. In: 2014 International Joint Conference on Neural Networks (IJCNN), Beijing, pp. 1739–1745, 2014. https://doi.org/10.1109/IJCNN.2014.6889415.
Kadam, V.J., and Jadhav, S.M.: Feature Ensemble Learning Based on Sparse Autoencoders for Diagnosis of Parkinson’s Disease. In: Iyer, B., Nalbalwar, S., and Pathak, N. (Eds.) , Computing, Communication and Signal Processing. Advances in Intelligent Systems and Computing, Vol. 810. Springer, Singapore, 2019.
Hinton, G.E., and Salakhutdinov, R.R., Reducing the dimensionality of data with neural networks. Science 28(5786):504–507, 2006.
Bengio, Y., and LeCun, Y.: Scaling learning algorithms towards AI. In: Bottou, L., Chapelle, O., DeCoste, D., and Weston, J. (Eds.) Large-Scale Kernel Machines. MIT Press, 2007.
Ranzato, M.A., Poultney, C., Chopra, S., LeCun, Y., Chopra, S., and LeCun, Y.: Efficient learning of sparse representations with an energy-based model. In: Advances in Neural Information Processing Systems 19 (NIPS’06), pp. 1137–1144, MIT Press, 2007.
Baldi, P.: Autoencoders, unsupervised learning, and deep architectures. In: Proceedings of ICML Workshop on Unsupervised and Transfer Learning, 2012.
Ng, A.: CS294A Lecture notes. Sparse autoencoder. pp. 72, 2011. (https://web.stanford.edu/class/cs294a/sparseAutoencoder.pdf)
Hinton, G.E., Osindero, S., and Teh, Y.-W., A fast learning algorithm for deep belief nets. Neural Comput. 18(7):1527–54, 2006.
Hinton, G.E., Learning multiple layers of representation. Trends Cogn. Sci. 11(10):428–34, 2007.
Erhan, D., Bengio, Y., Courville, A., Manzagol, P.-A., Vincent, P., and Bengio, S., Why does unsupervised pre-training help deep learning? J. Mach. Learn. Res. 11:625–660, 2010.
Kuncheva, L.I., Combining Pattern Classifiers, Methods and Algorithms, p. 544. New York: Wiley, 2004.
Kittler, J., Hatef, M., Duin, R.P.W., and Matas, J., On combining classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 20(3):226–239, 1998. https://doi.org/10.1109/34.667881.
Dua, D., and Graff, C., UCI Machine Learning Repository (http://archive.ics.uci.edu/ml). Irvine: University of California, School of Information and Computer Science, 2019.
Wolberg, W.H., Street, W.N., and Mangasarian, O.L., Machine learning techniques to diagnose breast cancer from image-processed nuclear features of fine needle aspirates. Cancer Lett. 77(2–3):163–171, 1994.
Nick Street, W., Wolberg, W.H., and Mangasarian, O.L., Nuclear feature extraction for breast tumor diagnosis. Proc. SPIE 1905:861–871, 1993.
Street, W.N., and Wolberg, W.H., Breast cancer diagnosis and prognosis via linear programming. Oper. Res. 43(4):570–577, 1995.
Pradeep Mohan Kumar, K., Saravanan, M., Thenmozhi, M., and Vijayakumar, K.: Intrusion detection system based on GA-fuzzy classifier for detecting malicious attacks. Concurr. Comput. Pract. Exp. e5242, 2019. https://doi.org/10.1002/cpe.5242
Joseph Manoj, R., Anto Praveena, M.D., and Vijayakumar, K.: An ACO–ANN based feature selection algorithm for big data. Cluster Comput., 2018. https://doi.org/10.1007/s10586-018-2550-z
Miao, D., Gao, C., Zhang, N., and Zhang, Z., Diverse reduct subspaces based co-training for partially labeled data. Int. J. Approx. Reason. 52:1103–1117, 2011.
Peng, L., Chen, W., Zhou, W., Li, F., Yang, J., and Zhang, J: An immune-inspired semi-supervised algorithm for breast cancer diagnosis. Comput. Methods Prog. Biomed., 2016. https://doi.org/10.1016/j.cmpb.2016.07.020
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflict of interest, financial or otherwise. No funding was received for this study.
Research involving human participants and/or animals
This research paper does not contain any studies with human participants or animals performed by any of the authors.
Informed consent
No humans are involved in this research paper.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the Topical Collection on Image & Signal Processing
Rights and permissions
About this article
Cite this article
Kadam, V.J., Jadhav, S.M. & Vijayakumar, K. Breast Cancer Diagnosis Using Feature Ensemble Learning Based on Stacked Sparse Autoencoders and Softmax Regression. J Med Syst 43, 263 (2019). https://doi.org/10.1007/s10916-019-1397-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10916-019-1397-z