Computer Science > Machine Learning

arXiv:2303.01486 (cs)

[Submitted on 2 Mar 2023 (v1), last revised 27 Nov 2023 (this version, v4)]

Title:Understanding plasticity in neural networks

Authors:Clare Lyle, Zeyu Zheng, Evgenii Nikishin, Bernardo Avila Pires, Razvan Pascanu, Will Dabney

View PDF

Abstract:Plasticity, the ability of a neural network to quickly change its predictions in response to new information, is essential for the adaptability and robustness of deep reinforcement learning systems. Deep neural networks are known to lose plasticity over the course of training even in relatively simple learning problems, but the mechanisms driving this phenomenon are still poorly understood. This paper conducts a systematic empirical analysis into plasticity loss, with the goal of understanding the phenomenon mechanistically in order to guide the future development of targeted solutions. We find that loss of plasticity is deeply connected to changes in the curvature of the loss landscape, but that it often occurs in the absence of saturated units. Based on this insight, we identify a number of parameterization and optimization design choices which enable networks to better preserve plasticity over the course of training. We validate the utility of these findings on larger-scale RL benchmarks in the Arcade Learning Environment.

Comments:	Accepted to ICML 2023 (oral presentation)
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2303.01486 [cs.LG]
	(or arXiv:2303.01486v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2303.01486

Submission history

From: Clare Lyle [view email]
[v1] Thu, 2 Mar 2023 18:47:51 UTC (7,230 KB)
[v2] Thu, 11 May 2023 19:05:00 UTC (11,115 KB)
[v3] Wed, 2 Aug 2023 03:50:54 UTC (11,118 KB)
[v4] Mon, 27 Nov 2023 16:36:53 UTC (11,118 KB)

Computer Science > Machine Learning

Title:Understanding plasticity in neural networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Understanding plasticity in neural networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators