Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1109/BCGIn.2011.62guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Hybrid Bisect K-Means Clustering Algorithm

Published: 29 July 2011 Publication History

Abstract

In this paper, we present a hybrid clustering algorithm that combines divisive and agglomerative hierarchical clustering algorithm. Our method uses bisect K-means for divisive clustering algorithm and Unweighted Pair Group Method with Arithmetic Mean (UPGMA) for agglomerative clustering algorithm. First, we cluster the document collection using bisect K-means clustering algorithm with the value K', which is greater than the total number of clusters, K. Second, we calculate the centroids of K' clusters obtained from the previous step. Then we apply the UPGMA agglomerative hierarchical algorithm on these centroids for the given value, K. After the UPGMA finds K clusters in these K' centroids, if two centroids ended up in the same cluster, then all of their documents will belong to the same cluster. We compared the goodness of clusters generated by bisect K-means and the proposed hybrid algorithms, measured on various cluster evaluation metrics. Our experimental results shows that the proposed method outperforms the standard bisect K-means algorithm.

Cited By

View all
  • (2019)Understanding the Predictability of Smartwatch UsageThe 5th ACM Workshop on Wearable Systems and Applications10.1145/3325424.3329661(11-16)Online publication date: 12-Jun-2019
  • (2016)Using Bisect K-Means Clustering Technique in the Analysis of Arabic DocumentsACM Transactions on Asian and Low-Resource Language Information Processing10.1145/281280915:3(1-13)Online publication date: 28-Jan-2016
  • (2015)AvalancheProceedings of the 11th International Conference on Machine Learning and Data Mining in Pattern Recognition - Volume 916610.1007/978-3-319-21024-7_20(296-310)Online publication date: 20-Jul-2015

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
BCGIN '11: Proceedings of the 2011 International Conference on Business Computing and Global Informatization
July 2011
660 pages
ISBN:9780769544649

Publisher

IEEE Computer Society

United States

Publication History

Published: 29 July 2011

Author Tags

  1. Bisect K-means
  2. document clustering
  3. hybrid algorithm

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Understanding the Predictability of Smartwatch UsageThe 5th ACM Workshop on Wearable Systems and Applications10.1145/3325424.3329661(11-16)Online publication date: 12-Jun-2019
  • (2016)Using Bisect K-Means Clustering Technique in the Analysis of Arabic DocumentsACM Transactions on Asian and Low-Resource Language Information Processing10.1145/281280915:3(1-13)Online publication date: 28-Jan-2016
  • (2015)AvalancheProceedings of the 11th International Conference on Machine Learning and Data Mining in Pattern Recognition - Volume 916610.1007/978-3-319-21024-7_20(296-310)Online publication date: 20-Jul-2015

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media