Computer Science > Machine Learning

arXiv:2311.16863 (cs)

[Submitted on 28 Nov 2023 (v1), last revised 15 Oct 2024 (this version, v3)]

Title:Power Hungry Processing: Watts Driving the Cost of AI Deployment?

Authors:Alexandra Sasha Luccioni, Yacine Jernite, Emma Strubell

Abstract:Recent years have seen a surge in the popularity of commercial AI products based on generative, multi-purpose AI systems promising a unified approach to building machine learning (ML) models into technology. However, this ambition of ``generality'' comes at a steep cost to the environment, given the amount of energy these systems require and the amount of carbon that they emit. In this work, we propose the first systematic comparison of the ongoing inference cost of various categories of ML systems, covering both task-specific (i.e. finetuned models that carry out a single task) and `general-purpose' models, (i.e. those trained for multiple tasks). We measure deployment cost as the amount of energy and carbon required to perform 1,000 inferences on representative benchmark dataset using these models. We find that multi-purpose, generative architectures are orders of magnitude more expensive than task-specific systems for a variety of tasks, even when controlling for the number of model parameters. We conclude with a discussion around the current trend of deploying multi-purpose generative ML systems, and caution that their utility should be more intentionally weighed against increased costs in terms of energy and emissions. All the data from our study can be accessed via an interactive demo to carry out further exploration and analysis.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2311.16863 [cs.LG]
	(or arXiv:2311.16863v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.16863
Journal reference:	ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT '24), June 3--6, 2024, Rio de Janeiro, Brazil
Related DOI:	https://doi.org/10.1145/3630106.3658542

Submission history

From: Alexandra Sasha Luccioni [view email]
[v1] Tue, 28 Nov 2023 15:09:36 UTC (1,693 KB)
[v2] Thu, 23 May 2024 20:15:44 UTC (10,520 KB)
[v3] Tue, 15 Oct 2024 20:54:08 UTC (10,521 KB)

Computer Science > Machine Learning

Title:Power Hungry Processing: Watts Driving the Cost of AI Deployment?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Power Hungry Processing: Watts Driving the Cost of AI Deployment?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators