Computer Science > Machine Learning

arXiv:2405.17897 (cs)

[Submitted on 28 May 2024 (v1), last revised 30 Oct 2024 (this version, v2)]

Title:$C^2M^3$: Cycle-Consistent Multi-Model Merging

Authors:Donato Crisostomi, Marco Fumero, Daniele Baieri, Florian Bernard, Emanuele Rodolà

Abstract:In this paper, we present a novel data-free method for merging neural networks in weight space. Differently from most existing works, our method optimizes for the permutations of network neurons globally across all layers. This allows us to enforce cycle consistency of the permutations when merging $N \geq 3$ models, allowing circular compositions of permutations to be computed without accumulating error along the path. We qualitatively and quantitatively motivate the need for such a constraint, showing its benefits when merging sets of models in scenarios spanning varying architectures and datasets. We finally show that, when coupled with activation renormalization, our approach yields the best results in the task.

Comments:	In Proceedings of the 38th Conference on Neural Information Processing Systems (NeurIPS 2024)
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2405.17897 [cs.LG]
	(or arXiv:2405.17897v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.17897

Submission history

From: Donato Crisostomi [view email]
[v1] Tue, 28 May 2024 07:18:45 UTC (4,370 KB)
[v2] Wed, 30 Oct 2024 07:18:46 UTC (4,476 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-05

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:$C^2M^3$: Cycle-Consistent Multi-Model Merging

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:$C^2M^3$: Cycle-Consistent Multi-Model Merging

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators