Computer Science > Machine Learning

arXiv:2310.09163 (cs)

[Submitted on 13 Oct 2023 (v1), last revised 10 May 2024 (this version, v2)]

Title:Jointly-Learned Exit and Inference for a Dynamic Neural Network : JEI-DNN

Authors:Florence Regol, Joud Chataoui, Mark Coates

Abstract:Large pretrained models, coupled with fine-tuning, are slowly becoming established as the dominant architecture in machine learning. Even though these models offer impressive performance, their practical application is often limited by the prohibitive amount of resources required for every inference. Early-exiting dynamic neural networks (EDNN) circumvent this issue by allowing a model to make some of its predictions from intermediate layers (i.e., early-exit). Training an EDNN architecture is challenging as it consists of two intertwined components: the gating mechanism (GM) that controls early-exiting decisions and the intermediate inference modules (IMs) that perform inference from intermediate representations. As a result, most existing approaches rely on thresholding confidence metrics for the gating mechanism and strive to improve the underlying backbone network and the inference modules. Although successful, this approach has two fundamental shortcomings: 1) the GMs and the IMs are decoupled during training, leading to a train-test mismatch; and 2) the thresholding gating mechanism introduces a positive bias into the predictive probabilities, making it difficult to readily extract uncertainty information. We propose a novel architecture that connects these two modules. This leads to significant performance improvements on classification datasets and enables better uncertainty characterization capabilities.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2310.09163 [cs.LG]
	(or arXiv:2310.09163v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.09163

Submission history

From: Florence Regol [view email]
[v1] Fri, 13 Oct 2023 14:56:38 UTC (1,211 KB)
[v2] Fri, 10 May 2024 08:43:52 UTC (2,500 KB)

Computer Science > Machine Learning

Title:Jointly-Learned Exit and Inference for a Dynamic Neural Network : JEI-DNN

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Jointly-Learned Exit and Inference for a Dynamic Neural Network : JEI-DNN

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators