IOS Press Ebooks - The Larger the Better: Analysis of a Scalable Spectral Clustering Algorithm with Cosine Similarity

loading subjects...

The Larger the Better: Analysis of a Scalable Spectral Clustering Algorithm with Cosine Similarity

Authors

Guangliang Chen

Pages

488 - 495

DOI

10.3233/FAIA210280

Category

Research Article

Series

Frontiers in Artificial Intelligence and Applications

Ebook

Volume 341: Modern Management based on Big Data II and Machine Learning and Intelligent Systems III

Abstract

Chen (2018) proposed a scalable spectral clustering algorithm for cosine similarity to handle the task of clustering large data sets. It runs extremely fast, with a linear complexity in the size of the data, and achieves state of the art accuracy. This paper conducts perturbation analysis of the algorithm to understand the effect of discarding a perturbation term in an eigendecomposition step. Our results show that the accuracy of the approximation by the scalable algorithm depends on the connectivity of the clusters, their separation and sizes, and is especially accurate for large data sets.

Contact

IOS Press Copyright 2024

Contact

IOS Press Copyright 2024

This website uses cookies

This website uses cookies