Abstract
Safety in patient decision-making is one of the major health care challenges. Computational support in establishing diagnoses and preventing errors will contribute to an enhancement in doctor-patient communication. This work performs a three-dimensional cluster analysis, using k-means algorithm, to identify patterns in a breast cancer database. The methodology proposed can be useful to identify patterns in the database that are normally difficult to be noted by classical methods, such as statistical methods. The three-dimensional cluster approach was explored combining three variables at once. The k-means algorithm is used to recognize the hidden patterns on the database. Sub-clusters are used to separate the benign and malignant tumors inside the global cluster. The results present effective analyses of three different clusters based on different combinations between variables. Thus, health professionals can obtain a better understanding of the properties of different types of tumor, identifying the mined abstract tumor features, through the cluster data analysis.
This work has been supported by FCT – Fundação para a Ciência e a Tecnologia within the R&D Units Projects Scope: UIDB/00319/2020 and UIDB/05757/2020. Filipe Alves is supported by FCT Grant Reference SFRH/BD/143745/2019.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bressan, G.M., Azevedo, B.C.F., Souza, R.M.: A fuzzy approach for diabetes mellitus type 2 classification. Brazil Arch. Biol. Technol. 63, 1–11 (2020). https://doi.org/10.1590/1678-4324-2020180742
Bressan, G.M., Azevedo, B.F.A., Souza, R.M.: Métodos de classificação automática para predição do perfil clínico de pacientes portadores do diabetes mellitus. Revista Brasileira de Biometria 38(2), 257–273 (2020)
Dougherty, G.: Pattern Recognition and Classification: An Introduction, 1 edn. Springer, New York (2013). https://doi.org/10.1007/978-1-4614-5323-9
Dua, D., Graff, C.: UCI - machine learning repository (2017). http://archive.ics.uci.edu/ml. Accessed Mar 2020
Dubey, A.K., Gupta, U., Jain, S.: Analysis of k-means clustering approach on the breast cancer Wisconsin dataset. Int. J. Comput. Assist. Radiol. Surg. 11(11), 2033–2047 (2016)
Evans, R.H.: An analysis of criterion variable reliability in conjoint analysis. Percept. Mot. Skills 82(3), 988–990 (1996)
Fisher, R.A.: Statistical Methods for Research Workers. Hafner, New York (1958)
Gomes, I., Nunes, C.: Analysis of the breast cancer mortality rate in Portugal over a decade: spatiotemporal clustering analysis. Acta Med. Port. 33(5), 305–310 (2020)
Illowsky, B., et al.: Introductory Statistics (2021). https://openstax.org/details/books/introductory-statistics. Accessed 21 Apr 2021
Jain, A.K.: Data clustering: 50 years beyond k-means. Pattern Recogn. Lett. 31(8), 651–666 (2010)
Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis, vol. 344. Wiley, Hoboken (2009)
Liu, Y., Li, Z., Xiong, H., Gao, X., Wu, J.: Understanding of internal clustering validation measures. In: 2010 IEEE International Conference on Data Mining, pp. 911–916. IEEE (2010)
Materials, N.: Ascent of machine learning in medicine. Nat. Mater. 18(407) (2019). https://doi.org/10.1038/s41563-019-0360-1
Rousseeuw, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987)
Rustam, Z., Hartini, S.: Classification of breast cancer using fast fuzzy clustering based on kernel. In: 9th Annual Basic Science International Conference 2019. Conference Series: Materials Science and Engineering, vol. 546, pp. 1–9 (2019)
Steinley, D.: K-means clustering: a half-century synthesis. Br. J. Math. Stat. Psychol. 59(1), 1–34 (2006)
Vidman, L., Källberg, D., Rydén, P.: Cluster analysis on high dimensional RNA-seq data with applications to cancer research - an evaluation study. PLoS ONE 14(12), 1–21 (2019)
Xie, J., Liu, R., Luttrell, J., Zhang, C.: Deep learning based analysis of histopathological images of breast cancer. Front. Genet. 10, 80 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Azevedo, B.F., Alves, F., Rocha, A.M.A.C., Pereira, A.I. (2021). Cluster Analysis for Breast Cancer Patterns Identification. In: Pereira, A.I., et al. Optimization, Learning Algorithms and Applications. OL2A 2021. Communications in Computer and Information Science, vol 1488. Springer, Cham. https://doi.org/10.1007/978-3-030-91885-9_37
Download citation
DOI: https://doi.org/10.1007/978-3-030-91885-9_37
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-91884-2
Online ISBN: 978-3-030-91885-9
eBook Packages: Computer ScienceComputer Science (R0)