Computer Science > Hardware Architecture

arXiv:2205.05826 (cs)

[Submitted on 12 May 2022 (v1), last revised 9 Jan 2023 (this version, v3)]

Title:Sparseloop: An Analytical Approach To Sparse Tensor Accelerator Modeling

Authors:Yannan Nellie Wu, Po-An Tsai, Angshuman Parashar, Vivienne Sze, Joel S. Emer

View PDF

Abstract:In recent years, many accelerators have been proposed to efficiently process sparse tensor algebra applications (e.g., sparse neural networks). However, these proposals are single points in a large and diverse design space. The lack of systematic description and modeling support for these sparse tensor accelerators impedes hardware designers from efficient and effective design space exploration. This paper first presents a unified taxonomy to systematically describe the diverse sparse tensor accelerator design space. Based on the proposed taxonomy, it then introduces Sparseloop, the first fast, accurate, and flexible analytical modeling framework to enable early-stage evaluation and exploration of sparse tensor accelerators. Sparseloop comprehends a large set of architecture specifications, including various dataflows and sparse acceleration features (e.g., elimination of zero-based compute). Using these specifications, Sparseloop evaluates a design's processing speed and energy efficiency while accounting for data movement and compute incurred by the employed dataflow as well as the savings and overhead introduced by the sparse acceleration features using stochastic tensor density models. Across representative accelerators and workloads, Sparseloop achieves over 2000 times faster modeling speed than cycle-level simulations, maintains relative performance trends, and achieves 0.1% to 8% average error. With a case study, we demonstrate Sparseloop's ability to help reveal important insights for designing sparse tensor accelerators (e.g., it is important to co-design orthogonal design aspects).

Comments:	Update website link, update UOP format description
Subjects:	Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as:	arXiv:2205.05826 [cs.AR]
	(or arXiv:2205.05826v3 [cs.AR] for this version)
	https://doi.org/10.48550/arXiv.2205.05826

Submission history

From: Yannan Wu [view email]
[v1] Thu, 12 May 2022 01:28:03 UTC (1,998 KB)
[v2] Thu, 15 Sep 2022 17:47:13 UTC (1,637 KB)
[v3] Mon, 9 Jan 2023 23:38:50 UTC (1,646 KB)

Computer Science > Hardware Architecture

Title:Sparseloop: An Analytical Approach To Sparse Tensor Accelerator Modeling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Hardware Architecture

Title:Sparseloop: An Analytical Approach To Sparse Tensor Accelerator Modeling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators