Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

What’s in a distance? Exploring the interplay between distance measures and internal cluster validity in multi-objective clustering

Published: 22 August 2022 Publication History

Abstract

The problem of cluster analysis eludes a unique mathematical definition. Instead, a variety of different instantiations of the problem can be defined using specific measures of internal cluster validity. In turn, such internal cluster validity measures rely on quantifying dissimilarity between entities. This article explores the interaction between dissimilarity measures and internal cluster validity techniques in the context of multi-objective clustering. It does so by contrasting two conceptually different approaches to multi-objective clustering: the multi-criterion clustering algorithm Δ-MOCK, designed to optimise different measures of internal cluster validity over a single dissimilarity space, and the multi-view clustering algorithm MVMC, designed to optimise a single measure of internal cluster validity over distinct dissimilarity spaces. Our comparison highlights the interchangeable roles of distance functions and measures of internal cluster validity, which paves the way for the future design of a flexible, dual-purpose approach to multi-objective clustering.

References

[1]
Aljalbout E, Golkov V, Siddiqui Y, et al (2018) Clustering with deep learning: taxonomy and new methods. arXiv:1801.07648
[2]
Bayá AE and Granitto PM How many clusters: a validation index for arbitrary-shaped clusters IEEE/ACM Trans Comput Biol Bioinf 2013 10 2 401-14
[3]
de Carvalho F, Lechevallier Y, and de Melo FM Partitioning hard clustering algorithms based on multiple dissimilarity matrices Pattern Recogn 2012 45 1 447-464
[4]
de Carvalho F, Lechevallier Y, Despeyroux T, et al., et al. Zighed F, Abdelkader G, Gilles P, et al., et al. Multi-view clustering on relational data Advances in knowledge discovery and management 2014 Heidelberg Springer 37-51
[5]
Delattre M and Hansen P Bicriterion cluster analysis IEEE Trans Pattern Anal Mach Intell 1980 2 4 277-291
[6]
Garza-Fabre M, Handl J, and Knowles J An improved and more scalable evolutionary approach to multiobjective clustering IEEE Trans Evol Comput 2018 22 4 515-535
[7]
Handl J and Knowles J An evolutionary approach to multiobjective clustering IEEE Trans Evol Comput 2007 11 1 56-76
[8]
Hennig C What are the true clusters? Pattern Recogn Lett 2015 64 53-62
[9]
Hubert L and Arabie P Comparing partitions J Classif 1985 2 1 193-218
[10]
José-García A and Gómez-Flores W Automatic clustering using nature-Inspired metaheuristics: a survey Appl Soft Comput 2016 41 192-213
[11]
José-García A, Handl J (2021) On the interaction between distance functions and clustering criteria in multi-objective clustering. In: International conference on evolutionary multi-criterion optimization, Springer, pp 504–515
[12]
José-García A, Handl J, Gómez-Flores W et al (2019) Many-view clustering: An illustration using multiple dissimilarity measures. In: Press ACM (ed) Genetic and Evolutionary Computation Conference - GECCO ’19. Republic Prague, Czech, pp 213–214
[13]
José-García A, Handl J, Gómez-Flores W, et al. An evolutionary many-objective approach to multiview clustering using feature and relational data Appl Soft Comput 2021 108 1-15
[14]
Kanaan-Izquierdo S, Ziyatdinov A, and Perera-Lluna A Multiview and multifeature spectral clustering using common eigenvectors Pattern Recogn Lett 2018 102 30-36
[15]
MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley symposium on mathematical statistics and probability. University of California Press, pp 281–297
[16]
Mukhopadhyay A, Maulik U, and Bandyopadhyay S A survey of multiobjective evolutionary clustering ACM Comput Surv (CSUR) 2015 47 4 1-46
[17]
Park Y, Song M (1998) A genetic algorithm for clustering problems. In: Proceedings of the Third Annual Conference on Genetic Programming, pp 568–575
[18]
Rousseeuw PJ Silhouettes: a graphical aid to the interpretation and validation of cluster analysis J Comput Appl Math 1987 20 53-65
[19]
Santos JM, de Sá JM (2005) Human clustering on bi-dimensional data: an assessment. Tech. rep, INEB -Instituto de Engenharia Biomedica
[20]
Sun S A survey of multi-view machine learning Neural Comput Appl 2013 23 7–8 2031-2038
[21]
Theodoridis S and Koutrumbas K Pattern recognition 2009 4 Amsterdam Elsevier Inc
[22]
Tibshirani R, Walther G, and Hastie T Estimating the number of clusters in a data set via the gap statistic J R Statist Soc Ser B (Statist Methodol) 2001 63 2 411-423
[23]
Von Luxburg U A tutorial on spectral clustering Stat Comput 2007 17 4 395-416
[24]
Zhang Q and Li H MOEA/D: a multiobjective evolutionary algorithm based on decomposition IEEE Trans Evol Comput 2007 11 6 712-731

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Natural Computing: an international journal
Natural Computing: an international journal  Volume 22, Issue 2
Jun 2023
178 pages

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 22 August 2022
Accepted: 25 July 2022

Author Tags

  1. Clustering
  2. Multi-objective optimisation
  3. Multi-view clustering

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Dec 2024

Other Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media