Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Clusterer ensemble

Published: 01 March 2006 Publication History

Abstract

Ensemble methods that train multiple learners and then combine their predictions have been shown to be very effective in supervised learning. This paper explores ensemble methods for unsupervised learning. Here, an ensemble comprises multiple clusterers, each of which is trained by k-means algorithm with different initial points. The clusters discovered by different clusterers are aligned, i.e. similar clusters are assigned with the same label, by counting their overlapped data items. Then, four methods are developed to combine the aligned clusterers. Experiments show that clustering performance could be significantly improved by ensemble methods, where utilizing mutual information to select a subset of clusterers for weighted voting is a nice choice. Since the proposed methods work by analyzing the clustering results instead of the internal mechanisms of the component clusterers, they are applicable to diverse kinds of clustering algorithms.

References

[1]
Estivill-Castro, V., Why so many clustering algorithms-a position paper. SIGKDD Explorations. v4 i1. 65-75.
[2]
Dietterich, T.G., Ensemble learning. In: Arbib, M.A. (Ed.), The Handbook of Brain Theory and Neural Networks, MIT Press, Cambridge, MA.
[3]
Hansen, L.K. and Salamon, P., Neural network ensembles. IEEE Transactions on Pattern Analysis and Machine Intelligence. v12 i10. 993-1001.
[4]
F.J. Huang, Z.-H. Zhou, H.-J. Zhang, T. Chen, Pose invariant face recognition, Proceedings of the Fourth IEEE International Conference on Automatic Face and Gesture Recognition, Grenoble, France, 2000, pp. 245-250.
[5]
Drucker, H., Schapire, R. and Simard, P., Improving performance in neural networks using a boosting algorithm. In: Hanson, S.J., Cowan, J.D., Lee Giles, C. (Eds.), Advances in Neural Information Processing Systems, 5. Morgan Kaufmann, San Mateo, CA. pp. 42-49.
[6]
K.J. Cherkauer, Human expert level performance on a scientific image analysis task by a system using combined artificial neural networks, Proceedings of the 13th AAAI Workshop on Integrating Multiple Learned Models for Improving and Scaling Machine Learning Algorithms, Portland, OR, 1996, pp. 15-21.
[7]
Zhou, Z.-H., Jiang, Y., Yang, Y.-B. and Chen, S.-F., Lung cancer cell identification based on artificial neural network ensembles. Artificial Intelligence in Medicine. v24 i1. 25-36.
[8]
Y. Freund, R.E. Schapire, A decision-theoretic generalization of on-line learning and an application to boosting, Proceedings of the second European Conference on Computational Learning Theory, Barcelona, Spain, 1995, pp. 23-37.
[9]
J. MacQueen, Some methods for classification and analysis of multivariate observations, Proceedings of the fifth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, vol. 1, 1967, pp. 281-297.
[10]
A. Strehl, J. Ghosh, R.J. Mooney, Impact of similarity measures on web-page clustering, Proceedings of the AAAI2000 Workshop on AI for Web Search, Austin, TX, 2000, pp. 58-64.
[11]
Zhou, Z.-H., Wu, J. and Tang, W., Ensembling neural networks: many could be better than all. Artificial Intelligence. v137 i1-2. 239-263.
[12]
C. Blake, E. Keogh, C.J. Merz, UCI repository of machine learning databases {http://www.ics.uci.edu/~mlearn/MLRepository.htm}, Department of Information and Computer Science, University of California, Irvine, CA, 1998.
[13]
D.S. Modha, W.S. Spangler, Feature weighting in k-means clustering, Machine Learning, 52 (3) (2003) 217-237.
[14]
Breiman, L., Bagging predictors. Machine Learning. v24 i2. 123-140.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Knowledge-Based Systems
Knowledge-Based Systems  Volume 19, Issue 1
March, 2006
104 pages

Publisher

Elsevier Science Publishers B. V.

Netherlands

Publication History

Published: 01 March 2006

Author Tags

  1. Clustering
  2. Ensemble learning
  3. Machine learning
  4. Selective ensemble
  5. Unsupervised learning

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Clustering Ensemble via Diffusion on Adaptive MultiplexIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.331140936:4(1463-1474)Online publication date: 1-Apr-2024
  • (2023)Partial Clustering EnsembleIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.332191336:5(2096-2109)Online publication date: 4-Oct-2023
  • (2023)Adaptive Ensemble Clustering With Boosting BLS-Based AutoencoderIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.327112035:12(12369-12383)Online publication date: 1-Dec-2023
  • (2023)Adaptive Consensus Clustering for Multiple K-Means Via Base Results RefiningIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.326497035:10(10251-10264)Online publication date: 1-Oct-2023
  • (2023)DFECTS: A Deep Fuzzy Ensemble Clusterer for Time SeriesAlgorithms and Architectures for Parallel Processing10.1007/978-981-97-0834-5_5(61-80)Online publication date: 20-Oct-2023
  • (2022)A multiple hierarchical clustering ensemble algorithm to recognize clusters arbitrarily shapedIntelligent Data Analysis10.3233/IDA-21611226:5(1211-1228)Online publication date: 1-Jan-2022
  • (2022)A Generative Clustering Ensemble Model and Its Application in IoT Data AnalysisWireless Communications & Mobile Computing10.1155/2022/80811772022Online publication date: 1-Jan-2022
  • (2022)Self-paced Adaptive Bipartite Graph Learning for Consensus ClusteringACM Transactions on Knowledge Discovery from Data10.1145/356470117:5(1-35)Online publication date: 27-Sep-2022
  • (2021)A novel clustering ensemble model based on granular computingApplied Intelligence10.1007/s10489-020-01979-851:8(5474-5488)Online publication date: 1-Aug-2021
  • (2020)Cluster ensemble of valid small clustersJournal of Intelligent & Fuzzy Systems10.3233/JIFS-19153039:1(525-542)Online publication date: 17-Jul-2020
  • Show More Cited By

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media