Abstract
Due to the advancement of information technology and its incorporation in various health applications, a huge amount of medical data is being produced continuously. Consequently, efficient techniques are required to analyse such large datasets and extract meaningful information as well as knowledge. Disease diagnosis is an important application domain of data mining techniques and can be resembled with the anomaly detection which is one of the primary tasks of data mining research. In past decades, heart disease caused the maximum death all over the world. As a result, heart disease diagnosis is a challenge for both data mining and health care communities. In this paper, co-clustering is introduced as a powerful data analysis tool to diagnose heart disease and extract the underlying data pattern of the datasets. The performance of the proposed method is evaluated using Cleveland Clinic Foundation Heart Disease dataset against other existing clustering based anomaly detection techniques. Experimental results reflect not only better accuracy but also meaningful information about the dataset which is helpful for further analysis of heart disease diagnosis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
World Health Organization. http://www.who.int/en/
Chandola, V., Banerjee, A.: Anomaly detection: a survey. ACM Comput. Surv. 41(3), 15:1–15:58 (2009)
Ahmed, M., Mahmood, A., Hu, J.: Outlier detection. In: Khan Pathan, A.-S. (ed.) The State of the Art in Intrusion Prevention and Detection, pp. 3–23. CRC Press, Boca Raton (2014)
Fang, X.: Are you becoming a diabetic? a data mining approach. In: Proceedings of the 6th International Conference on Fuzzy Systems and Knowledge Discovery - Volume 5, ser. FSKD 2009, pp. 18–22. IEEE Press, Piscataway (2009)
Ahmed. M., Naser. A.: A novel approach for outlier detection and clustering improvement. In: 2013 8th IEEE Conference on Industrial Electronics and Applications (ICIEA), pp. 577–582 (2013)
Tucker, L.R.: The extension of factor analysis to three-dimensional matrices. In: Gulliksen, H., Frederiksen, N. (eds.) Contributions to Mathematical Psychology, pp. 110–127. Holt, Rinehart and Winston, New York (1964)
Tucker, L.R.: Clustering with block mixture models. Pattern Recogn. 36(2), 463–473 (2003)
Tucker, L.R.: Block clustering with bernoulli mixture models: comparison of different approaches. Comput. Stat. Data Anal. 52(6), 3233–3245 (2008)
Banerjee, A., Dhillon, I., Ghosh, J., Merugu, S., Modha, D.S.: A generalized maximum entropy approach to bregman co-clustering and matrix approximation. J. Mach. Learn. Res. 8, 1919–1986 (2007)
Knorr, E.M., Ng, R.T.: Algorithms for mining distance-based outliers in large datasets. In: Proceedings of the 24rd International Conference on Very Large Data Bases, ser. VLDB 1998, pp. 392–403. Morgan Kaufmann Publishers Inc., San Francisco (1998)
Ramaswamy, S., Rastogi, R., Shim, K.: Efficient algorithms for mining outliers from large data sets. SIGMOD Rec. 29(2), 427–438 (2000)
Breunig, M.M., Kriegel, H.-P., Ng, R.T., Sander, J.: Lof: identifying density-based local outliers. SIGMOD Rec. 29(2), 93–104 (2000)
He, Z., Xu, X., Deng, S.: Discovering cluster based local outliers. Pattern Recogn. Lett. 2003, 9–10 (2003)
Mennatallah Amer, M.G.: Nearest-neighbor and Clustering Based Anomaly Detection Algorithms For Rapidminer. Shaker Verlag GmbH, Aachen (2012)
Bache, K., Lichman, M.: UCI machine learning repository (2013). http://archive.ics.uci.edu/ml
Shouman, M., Turner, T., Stocker, R.: Using decision tree for diagnosing heart disease patients. In: Proceedings of the Ninth Australasian Data Mining Conference - Volume 121, ser. AusDM 2011, pp. 23–30. Australian Computer Society Inc., Darlinghurst (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Ahmed, M., Mahmood, A.N., Maher, M.J. (2015). Heart Disease Diagnosis Using Co-clustering. In: Jung, J., Badica, C., Kiss, A. (eds) Scalable Information Systems. INFOSCALE 2014. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 139. Springer, Cham. https://doi.org/10.1007/978-3-319-16868-5_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-16868-5_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16867-8
Online ISBN: 978-3-319-16868-5
eBook Packages: Computer ScienceComputer Science (R0)