Abstract
In fuzzy clustering ensemble, the quality of fuzzy base clustering has an important influence on the performance of the final clustering result. Due to the performance of fuzzy clustering is affected by the initial parameters and fuzzy factors, it may cause unstable clustering results such as unsatisfactory data affiliation, and large differences with the distribution of the real data set. In addition, how to use fuzzy ensemble information to determine the similarity among samples effectively plays a crucial role in the generation of co-association matrix elements. In view of the above problems, combined with the compactness, separation and overlap in the evaluation index of fuzzy clustering, an optimized fuzzy clustering evaluation index is designed to select high quality fuzzy base clustering members to participate the final fusion. Then, the concept of sample attribution clarity is proposed, and the attribution clarity of each sample in the fuzzy base clustering set is learned actively. For samples with different attribution clarity, different full-link similarity measurement methods between samples are designed to further reduce the uncertainty of samples. Finally, the clustering results are obtained by the agglomerative hierarchical clustering. In order to verify the effectiveness of the proposed method, ten data sets are used to conduct experiments. Experiments show that the results obtained by the proposed method are closer to the real distribution structure of the data set in most experimental dataset, and are not sensitive to the diversity of base clustering members, and have good robustness.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
The experimental data sets in this paper are all from the UCI Machine Learning repository.
References
Xu R, Wunsch D (2005) Survey of clustering algorithms. IEEE Trans Neural Net 16(3):645–678
Saxena A, Prasad M, Gupta A et al (2017) A review of clustering techniques and developments. Neurocomputing 267:664–681
Yuan KH, Xu WH, Li WT et al (2022) An incremental learning mechanism for object classificationbased on progressive fuzzy three-way concept. Inf Sci 584(1):127–147
Zhou P, Wang X, Du L et al (2022) Clustering ensemble via structured hypergraph learning. Inform Fusion 78:171–179
Chen Z, Bagherinia A, Minaei-Bidgoli B et al (2021) Fuzzy clustering ensemble considering cluster dependability. Int J Artif Intell Tools 30(2):2150007
Xu WH, Guo DD, Qian YH et al (2022) Two-way concept-cognitive learning method: a fuzzy-based progressive learning. IEEE Trans Fuzzy Syst. https://doi.org/10.1109/TFUZZ.2022.3216110
Xu WH, Yuan KH, Li WT et al (2023) An emerging fuzzy feature selection method using composite entropy-based uncertainty measure and data distribution. IEEE Trans Emerg Top comp Intel 7(1):76–88
Bagherinia A, Minaei-Bidgoli B, Hossinzadeh M et al (2019) Elite fuzzy clustering ensemble based on clustering diversity and quality measures. Appl Intell 49:1724–1747
Banerjee A, Pujari A, Rani Panigrahi C et al (2021) A new method for weighted ensemble clustering and coupled ensemble selection. Connect Sci 33(3):623–644
Mojarad M, Nejatian S, Parvin H et al (2019) A fuzzy clustering ensemble based on cluster clustering and iterative Fusion of base clusters. Appl Intell 49:2567–2581
Bagherinia A, Minaei-Bidgoli B, Hossinzadeh M et al (2021) Reliability-Based Fuzzy Clustering Ensemble. Fuzzy Sets Syst 413:1–28
Li WJ, Wang ZK, Sun W et al (2022) An ensemble clustering framework based on hierarchical clustering ensemble selection and clusters clustering. Cybern Syst. https://doi.org/10.1080/01969722.2022.2073704
Wang YX, Yuan LP, Garg H et al (2021) Information theoretic weighted fuzzy clustering ensemble. Cmc-Comp Mater Cont 67(1):369–392
Bai L, Liang JY, Guo YK (2018) An ensemble clusterer of multiple fuzzy k-means clusterings to recognize arbitrarily shaped clusters. IEEE Trans Fuzzy Syst 26(6):3524–3533
Rathore P, Bezdek JC, Erfani SM et al (2018) Ensemble fuzzy clustering using cumulative aggregation on random projections. IEEE Trans Fuzzy Syst 26(3):1510–1524
Liu HQ, Zhang Q, Zhao F (2018) Interval fuzzy spectral clustering ensemble algorithm for color image segmentation. J Intel Fuzzy Syst 35(5):5467–5476
Iam-On N, Boongoen T, Garrett S (2010) LCE: a link-based cluster ensemble method for improved gene expression data analysis. Bioinformatics 26(12):1513–1519
Bezdek JC, Ehrlich R, Full W (1984) FCM: The fuzzy c-means clustering algorithm. Comput Geosci 10(2):191–203
Jiang CM, Li ZC, Yao JT (2022) A shadowed set-based three-way clustering ensemble approach. Int J Mach Learn Cyber 13(9):2545–2558
Zhang MM (2022) Weighted clustering ensemble: a review. Pattern Recogn 124:108428
Hu J, Li TR, Luo C et al (2017) Incremental fuzzy cluster ensemble learning based on rough set theory. Knowl-Based Syst 132:144–155
Su P, Shang C, Shen Q. 2014 Link-based pairwise similarity matrix approach for fuzzy c-means clustering ensemble. IEEE International Conference on Fuzzy Systems. IEEE 1538–1544
Wu S, Jiang QS, Hong ZL, et al. 2006 A Novel Fuzzy Cluster Validity Index with New Compositions. Proc. of the 6th World Congress on Intelligent Control and Automation, 5967 -5971.
Tang MH, Yang Y, Zhang WB. 2009 An improved clustering validity function for the fuzzy cmeans algorithm. Proc. of the 4th International Conference on Intelligent Systems and Knowledge Engineering, 209–214.
Chen J M. 2012 The improved partition entropy coefficient. Multimedia and Signal Processing: Second International Conference, CMSP 2012, Shanghai, China. Springer Berlin Heidelberg, 1-7
Rashidi F, Nejatian S, Parvin H et al (2019) Diversity based cluster weighting in cluster ensemble: an information theory approach. Artif Intell Rev 52(2):1341–1368
Xu WH, Guo DD, Mi JS et al (2023) Two-way concept-cognitive learning via concept movement viewpoint. IEEE Trans Neural Net Lear Syst. https://doi.org/10.1109/TNNLS.2023.3235800
Xu WH, Pan YZ, Chen XW et al (2022) a novel dynamic fusion approach using information entropy for interval-valued ordered datasets. IEEE Trans Big Data. https://doi.org/10.1109/TBDATA.2022.3215494
Acknowledgements
This work is supported by National Natural Science Foundation of China (No.62206240), the Shandong Provincial Natural Science Foundation of China (No. ZR2020QF110), and Guangxi Key Laboratory of Hybrid Computation and IC Design Analysis (GUANGXI MINZU UNIVERSITY)(No. GXIC20-04).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Xu, L., Yan, X., Huang, J. et al. A fuzzy clustering ensemble selection based on active full-link similarity. Int. J. Mach. Learn. & Cyber. 14, 4325–4337 (2023). https://doi.org/10.1007/s13042-023-01896-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-023-01896-5