Computer Science > Machine Learning

arXiv:2402.02362 (cs)

[Submitted on 4 Feb 2024]

Title:Unification of Symmetries Inside Neural Networks: Transformer, Feedforward and Neural ODE

Authors:Koji Hashimoto, Yuji Hirono, Akiyoshi Sannai

Abstract:Understanding the inner workings of neural networks, including transformers, remains one of the most challenging puzzles in machine learning. This study introduces a novel approach by applying the principles of gauge symmetries, a key concept in physics, to neural network architectures. By regarding model functions as physical observables, we find that parametric redundancies of various machine learning models can be interpreted as gauge symmetries. We mathematically formulate the parametric redundancies in neural ODEs, and find that their gauge symmetries are given by spacetime diffeomorphisms, which play a fundamental role in Einstein's theory of gravity. Viewing neural ODEs as a continuum version of feedforward neural networks, we show that the parametric redundancies in feedforward neural networks are indeed lifted to diffeomorphisms in neural ODEs. We further extend our analysis to transformer models, finding natural correspondences with neural ODEs and their gauge symmetries. The concept of gauge symmetries sheds light on the complex behavior of deep learning models through physics and provides us with a unifying perspective for analyzing various machine learning architectures.

Comments:	11 pages, 3 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); High Energy Physics - Theory (hep-th); Computational Physics (physics.comp-ph)
Report number:	KUNS-2992
Cite as:	arXiv:2402.02362 [cs.LG]
	(or arXiv:2402.02362v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.02362

Submission history

From: Koji Hashimoto [view email]
[v1] Sun, 4 Feb 2024 06:11:54 UTC (336 KB)

Computer Science > Machine Learning

Title:Unification of Symmetries Inside Neural Networks: Transformer, Feedforward and Neural ODE

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Unification of Symmetries Inside Neural Networks: Transformer, Feedforward and Neural ODE

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators