Computer Science > Machine Learning

arXiv:2408.10664 (cs)

[Submitted on 20 Aug 2024]

Title:Federated Clustering: An Unsupervised Cluster-Wise Training for Decentralized Data Distributions

Authors:Mirko Nardi, Lorenzo Valerio, Andrea Passarella

Abstract:Federated Learning (FL) is a pivotal approach in decentralized machine learning, especially when data privacy is crucial and direct data sharing is impractical. While FL is typically associated with supervised learning, its potential in unsupervised scenarios is underexplored. This paper introduces a novel unsupervised federated learning methodology designed to identify the complete set of categories (global K) across multiple clients within label-free, non-uniform data distributions, a process known as Federated Clustering. Our approach, Federated Cluster-Wise Refinement (FedCRef), involves clients that collaboratively train models on clusters with similar data distributions. Initially, clients with diverse local data distributions (local K) train models on their clusters to generate compressed data representations. These local models are then shared across the network, enabling clients to compare them through reconstruction error analysis, leading to the formation of federated this http URL these groups, clients collaboratively train a shared model representing each data distribution, while continuously refining their local clusters to enhance data association accuracy. This iterative process allows our system to identify all potential data distributions across the network and develop robust representation models for each. To validate our approach, we compare it with traditional centralized methods, establishing a performance baseline and showcasing the advantages of our distributed solution. We also conduct experiments on the EMNIST and KMNIST datasets, demonstrating FedCRef's ability to refine and align cluster models with actual data distributions, significantly improving data representation precision in unsupervised federated settings.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2408.10664 [cs.LG]
	(or arXiv:2408.10664v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2408.10664

Submission history

From: Mirko Nardi [view email]
[v1] Tue, 20 Aug 2024 09:05:44 UTC (4,735 KB)

Computer Science > Machine Learning

Title:Federated Clustering: An Unsupervised Cluster-Wise Training for Decentralized Data Distributions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Federated Clustering: An Unsupervised Cluster-Wise Training for Decentralized Data Distributions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators