Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Robust projected clustering

Published: 18 March 2008 Publication History

Abstract

Projected clustering partitions a data set into several disjoint clusters, plus outliers, so that each cluster exists in a subspace. Subspace clustering enumerates clusters of objects in all subspaces of a data set, and it tends to produce many overlapping clusters. Such algorithms have been extensively studied for numerical data, but only a few have been proposed for categorical data. Typical drawbacks of existing projected and subspace clustering algorithms for numerical or categorical data are that they rely on parameters whose appropriate values are difficult to set appropriately or that they are unable to identify projected clusters with few relevant attributes. We present P3C, a robust algorithm for projected clustering that can effectively discover projected clusters in the data while minimizing the number of required parameters. P3C does not need the number of projected clusters as input, and can discover, under very general conditions, the true number of projected clusters. P3C is effective in detecting very low-dimensional projected clusters embedded in high dimensional spaces. P3C positions itself between projected and subspace clustering in that it can compute both disjoint or overlapping clusters. P3C is the first projected clustering algorithm for both numerical and categorical data.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Knowledge and Information Systems
Knowledge and Information Systems  Volume 14, Issue 3
March 2008
143 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 18 March 2008

Author Tags

  1. Clustering numerical and categorical data
  2. Projected clustering
  3. Subspace clustering

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Shared Subscribe Hyper Simulation Optimization (SUBHSO) Algorithm for Clustering Big Data – Using Big Databases of Iran Electricity MarketApplied Computer Systems10.2478/acss-2019-000724:1(49-60)Online publication date: 1-May-2019
  • (2019)Connectedness-based subspace clusteringKnowledge and Information Systems10.1007/s10115-018-1181-258:1(9-34)Online publication date: 1-Jan-2019
  • (2017)Local graph based correlation clusteringKnowledge-Based Systems10.5555/3163580.3163642138:C(155-175)Online publication date: 15-Dec-2017
  • (2016)Soft subspace clustering of categorical data with probabilistic distancePattern Recognition10.1016/j.patcog.2015.09.02751:C(322-332)Online publication date: 1-Mar-2016
  • (2015)Fast Dimension-based Partitioning and Merging clustering algorithmApplied Soft Computing10.1016/j.asoc.2015.05.04936:C(143-151)Online publication date: 1-Nov-2015
  • (2015)Clustering categorical data in projected spacesData Mining and Knowledge Discovery10.1007/s10618-013-0336-829:1(3-38)Online publication date: 1-Jan-2015
  • (2014)Semi-supervised projected model-based clusteringData Mining and Knowledge Discovery10.1007/s10618-013-0323-028:4(882-917)Online publication date: 1-Jul-2014
  • (2013)Projective clustering ensemblesData Mining and Knowledge Discovery10.1007/s10618-012-0266-x26:3(452-511)Online publication date: 1-May-2013
  • (2012)Cluster-based instance selection for machine classificationKnowledge and Information Systems10.5555/3225656.322593630:1(113-133)Online publication date: 1-Jan-2012
  • (2012)A weightless neural network-based approach for stream data clusteringProceedings of the 13th international conference on Intelligent Data Engineering and Automated Learning10.1007/978-3-642-32639-4_40(328-335)Online publication date: 29-Aug-2012
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media