Computer Science > Machine Learning

arXiv:2206.00050 (cs)

[Submitted on 31 May 2022 (v1), last revised 19 Dec 2022 (this version, v4)]

Title:FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation

Authors:Mehmet Ozgur Turkoglu, Alexander Becker, Hüseyin Anil Gündüz, Mina Rezaei, Bernd Bischl, Rodrigo Caye Daudt, Stefano D'Aronco, Jan Dirk Wegner, Konrad Schindler

View PDF

Abstract:The ability to estimate epistemic uncertainty is often crucial when deploying machine learning in the real world, but modern methods often produce overconfident, uncalibrated uncertainty predictions. A common approach to quantify epistemic uncertainty, usable across a wide class of prediction models, is to train a model ensemble. In a naive implementation, the ensemble approach has high computational cost and high memory demand. This challenges in particular modern deep learning, where even a single deep network is already demanding in terms of compute and memory, and has given rise to a number of attempts to emulate the model ensemble without actually instantiating separate ensemble members. We introduce FiLM-Ensemble, a deep, implicit ensemble method based on the concept of Feature-wise Linear Modulation (FiLM). That technique was originally developed for multi-task learning, with the aim of decoupling different tasks. We show that the idea can be extended to uncertainty quantification: by modulating the network activations of a single deep network with FiLM, one obtains a model ensemble with high diversity, and consequently well-calibrated estimates of epistemic uncertainty, with low computational overhead in comparison. Empirically, FiLM-Ensemble outperforms other implicit ensemble methods, and it and comes very close to the upper bound of an explicit ensemble of networks (sometimes even beating it), at a fraction of the memory cost.

Comments:	accepted at NeurIPS 2022
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2206.00050 [cs.LG]
	(or arXiv:2206.00050v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.00050

Submission history

From: Mehmet Ozgur Turkoglu [view email]
[v1] Tue, 31 May 2022 18:33:15 UTC (449 KB)
[v2] Sat, 15 Oct 2022 13:34:44 UTC (285 KB)
[v3] Wed, 2 Nov 2022 13:59:18 UTC (285 KB)
[v4] Mon, 19 Dec 2022 07:53:27 UTC (286 KB)

Computer Science > Machine Learning

Title:FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators