Computer Science > Machine Learning

arXiv:2301.12309 (cs)

[Submitted on 28 Jan 2023 (v1), last revised 14 Nov 2023 (this version, v4)]

Title:On the Lipschitz Constant of Deep Networks and Double Descent

Authors:Matteo Gamba, Hossein Azizpour, Mårten Björkman

View PDF

Abstract:Existing bounds on the generalization error of deep networks assume some form of smooth or bounded dependence on the input variable, falling short of investigating the mechanisms controlling such factors in practice. In this work, we present an extensive experimental study of the empirical Lipschitz constant of deep networks undergoing double descent, and highlight non-monotonic trends strongly correlating with the test error. Building a connection between parameter-space and input-space gradients for SGD around a critical point, we isolate two important factors -- namely loss landscape curvature and distance of parameters from initialization -- respectively controlling optimization dynamics around a critical point and bounding model function complexity, even beyond the training data. Our study presents novels insights on implicit regularization via overparameterization, and effective model complexity for networks trained in practice.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2301.12309 [cs.LG]
	(or arXiv:2301.12309v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2301.12309

Submission history

From: Matteo Gamba [view email]
[v1] Sat, 28 Jan 2023 23:22:49 UTC (1,812 KB)
[v2] Thu, 16 Feb 2023 03:32:37 UTC (1,906 KB)
[v3] Thu, 27 Apr 2023 13:39:51 UTC (31,837 KB)
[v4] Tue, 14 Nov 2023 15:48:48 UTC (33,125 KB)

Computer Science > Machine Learning

Title:On the Lipschitz Constant of Deep Networks and Double Descent

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On the Lipschitz Constant of Deep Networks and Double Descent

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators