research-article

Public Access

Tensor-matrix products with a compressed sparse tensor

Authors:

George KarypisAuthors Info & Claims

IA³ '15: Proceedings of the 5th Workshop on Irregular Applications: Architectures and Algorithms

Article No.: 5, Pages 1 - 7

https://doi.org/10.1145/2833179.2833183

Published: 15 November 2015 Publication History

Abstract

The Canonical Polyadic Decomposition (CPD) of tensors is a powerful tool for analyzing multi-way data and is used extensively to analyze very large and extremely sparse datasets. The bottleneck of computing the CPD is multiplying a sparse tensor by several dense matrices. Algorithms for tensor-matrix products fall into two classes. The first class saves floating point operations by storing a compressed tensor for each dimension of the data. These methods are fast but suffer high memory costs. The second class uses a single uncompressed tensor at the cost of additional floating point operations. In this work, we bridge the gap between the two approaches and introduce the compressed sparse fiber (CSF) a data structure for sparse tensors along with a novel parallel algorithm for tensor-matrix multiplication. CSF offers similar operation reductions as existing compressed methods while using only a single tensor structure. We validate our contributions with experiments comparing against state-of-the-art methods on a diverse set of datasets. Our work uses 58% less memory than the state-of-the-art while achieving 81% of the parallel performance on 16 threads.

References

[1]

B. W. Bader and T. G. Kolda. Efficient MATLAB computations with sparse and factored tensors. SIAM Journal on Scientific Computing, 30(1):205--231, December 2007.

Digital Library

[2]

B. W. Bader, T. G. Kolda, et al. Matlab tensor toolbox version 2.6, Feb. 2015. http://www.sandia.gov/~tgkolda/TensorToolbox/.

[3]

J. Bennett and S. Lanning. The netflix prize. In Proceedings of KDD cup and workshop, volume 2007, page 35, 2007.

[4]

A. Carlson, J. Betteridge, B. Kisiel, B. Settles, E. R. Hruschka, and T. M. Mitchell. Toward an architecture for never-ending language learning. In In AAAI, 2010.

Digital Library

[5]

J. D. Carroll and J.-J. Chang. Analysis of individual differences in multidimensional scaling via an n-way generalization of "Eckart-Young" decomposition. Psychometrika, 35(3):283--319, 1970.

[6]

J. H. Choi and S. Vishwanathan. DFacTo: Distributed factorization of tensors. In Advances in Neural Information Processing Systems, pages 1296--1304, 2014.

Digital Library

[7]

O. Görlitz, S. Sizov, and S. Staab. Pints: peer-to-peer infrastructure for tagging systems. In IPTPS, page 19, 2008.

Digital Library

[8]

J. C. Ho, J. Ghosh, and J. Sun. Marble: high-throughput phenotyping from electronic health records via sparse nonnegative tensor factorization. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 115--124. ACM, 2014.

Digital Library

[9]

O. Kaya and B. Uçar. Scalable sparse tensor decompositions in distributed memory systems. Technical report, 2015.

[10]

T. G. Kolda and B. Bader. The TOPHITS model for higher-order web link analysis. In Proceedings of Link Analysis, Counterterrorism and Security 2006, 2006.

[11]

T. G. Kolda and B. W. Bader. Tensor decompositions and applications. SIAM review, 51(3):455--500, 2009.

Digital Library

[12]

S. Leurgans and R. T. Ross. Multilinear models: applications in spectroscopy. Statistical Science, pages 289--310, 1992.

[13]

J. McAuley and J. Leskovec. Hidden factors and hidden topics: understanding rating dimensions with review text. In Proceedings of the 7th ACM conference on Recommender systems, pages 165--172. ACM, 2013.

Digital Library

[14]

J. McAuley, J. Leskovec, and D. Jurafsky. Learning attitudes and attributes from multi-aspect reviews. In Data Mining (ICDM), 2012 IEEE 12th International Conference on, pages 1020--1025. IEEE, 2012.

Digital Library

[15]

M. F. Porter. An algorithm for suffix stripping. Program, 14(3):130--137, 1980.

[16]

N. Ravindran, N. D. Sidiropoulos, S. Smith, and G. Karypis. Memory-efficient parallel computation of tensor and matrix products for big tensor decomposition. In Proceedings of the Asilomar Conference on Signals, Systems, and Computers, 2014.

[17]

Y. Shi, A. Karatzoglou, L. Baltrunas, M. Larson, A. Hanjalic, and N. Oliver. Tfmap: optimizing map for top-n context-aware recommendation. In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval, pages 155--164. ACM, 2012.

Digital Library

[18]

S. Smith and G. Karypis. Dms: Distributed sparse tensor factorization with alternating least squares. Technical report, 2015.

[19]

S. Smith, N. Ravindran, N. D. Sidiropoulos, and G. Karypis. SPLATT: Efficient and parallel sparse tensor-matrix multiplication. In International Parallel & Distributed Processing Symposium (IPDPS'15), 2015.

Digital Library

Cited By

Huang Sliu fLi TWang ZYang NLi HJiang L(2024)STCO: Enhancing Training Efficiency via Structured Sparse Tensor Compilation OptimizationACM Transactions on Design Automation of Electronic Systems10.1145/370103330:1(1-22)Online publication date: 21-Oct-2024
https://dl.acm.org/doi/10.1145/3701033
Liu PRoot AXu ALi YKjolstad FBik A(2024)Compiler Support for Sparse Tensor ConvolutionsProceedings of the ACM on Programming Languages10.1145/36897218:OOPSLA2(275-303)Online publication date: 8-Oct-2024
https://dl.acm.org/doi/10.1145/3689721
Basak BDasgupta PPal A(2024)Efficient Low-Memory Implementation of Sparse CNNs Using Encoded Partitioned Hybrid Sparse FormatACM Transactions on Embedded Computing Systems10.1145/368723923:6(1-30)Online publication date: 22-Aug-2024
https://dl.acm.org/doi/10.1145/3687239
Show More Cited By

Tensor-matrix products with a compressed sparse tensor
1. Theory of computation
  1. Design and analysis of algorithms

Recommendations

Sparse and Truncated Nuclear Norm Based Tensor Completion

One of the main difficulties in tensor completion is the calculation of the tensor rank. Recently a tensor nuclear norm, which is equal to the weighted sum of matrix nuclear norms of all unfoldings of the tensor, was proposed to address this issue. ...
Convex coupled matrix and tensor completion

We propose a set of convex low-rank inducing norms for coupled matrices and tensors hereafter referred to as coupled tensors, in which information is shared between the matrices and tensors through common modes. More specifically, we first propose a ...
Robust low tubal rank tensor completion via factor tensor norm minimization
Highlights
- We give the definitions of tensor double norm and tensor Frobenius/nuclear hybrid norm, and regard them as low-rank regularization penalty of tensor ...
Abstract
Recent research has demonstrated that low tubal rank recovery based on tensor has received extensive attention. In this correspondence, we define tensor double nuclear norm and tensor Frobenius/nuclear hybrid norm to induce a surrogate ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

IA³ '15: Proceedings of the 5th Workshop on Irregular Applications: Architectures and Algorithms

November 2015

79 pages

ISBN:9781450340014

DOI:10.1145/2833179

Conference Chairs:
Antonino Tumeo
Pacific Northwest National Laboratory
,
John Feo
Pacific Northwest National Laboratory
,
Oreste Villa
NVIDIA

Copyright © 2015 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGHPC: ACM Special Interest Group on High Performance Computing, Special Interest Group on High Performance Computing
SIGARCH: ACM Special Interest Group on Computer Architecture
IEEE-CS\DATC: IEEE Computer Society

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 November 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article

Funding Sources

Conference

SC15

Sponsor:

SIGHPC
SIGARCH
IEEE-CS\DATC

SC15: The International Conference for High Performance Computing, Networking, Storage and Analysis

November 15, 2015

Texas, Austin

Acceptance Rates

IA³ '15 Paper Acceptance Rate 6 of 24 submissions, 25%;

Overall Acceptance Rate 18 of 67 submissions, 27%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

93
Total Citations
View Citations
1,999
Total Downloads

Downloads (Last 12 months)378
Downloads (Last 6 weeks)56

Reflects downloads up to 16 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Huang Sliu fLi TWang ZYang NLi HJiang L(2024)STCO: Enhancing Training Efficiency via Structured Sparse Tensor Compilation OptimizationACM Transactions on Design Automation of Electronic Systems10.1145/370103330:1(1-22)Online publication date: 21-Oct-2024
https://dl.acm.org/doi/10.1145/3701033
Liu PRoot AXu ALi YKjolstad FBik A(2024)Compiler Support for Sparse Tensor ConvolutionsProceedings of the ACM on Programming Languages10.1145/36897218:OOPSLA2(275-303)Online publication date: 8-Oct-2024
https://dl.acm.org/doi/10.1145/3689721
Basak BDasgupta PPal A(2024)Efficient Low-Memory Implementation of Sparse CNNs Using Encoded Partitioned Hybrid Sparse FormatACM Transactions on Embedded Computing Systems10.1145/368723923:6(1-30)Online publication date: 22-Aug-2024
https://dl.acm.org/doi/10.1145/3687239
Zhang GHsu OKjolstad F(2024)Compilation of Modular and General Sparse WorkspacesProceedings of the ACM on Programming Languages10.1145/36564268:PLDI(1213-1238)Online publication date: 20-Jun-2024
https://dl.acm.org/doi/10.1145/3656426
Kanakagiri RSolomonik EAgrawal KPetrank E(2024)Minimum Cost Loop Nests for Contraction of a Sparse Tensor with a Tensor NetworkProceedings of the 36th ACM Symposium on Parallelism in Algorithms and Architectures10.1145/3626183.3659985(169-181)Online publication date: 17-Jun-2024
https://dl.acm.org/doi/10.1145/3626183.3659985
Lu LLuo ZZheng SYin JCong JLiang YYin J(2024)Rubick: A Unified Infrastructure for Analyzing, Exploring, and Implementing Spatial Architectures via Dataflow DecompositionIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2023.333720843:4(1177-1190)Online publication date: Apr-2024
https://doi.org/10.1109/TCAD.2023.3337208
Dong BWu KByna S(2024)The Art of Sparsity: Mastering High-Dimensional Tensor Storage2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)10.1109/IPDPSW63119.2024.00094(439-446)Online publication date: 27-May-2024
https://doi.org/10.1109/IPDPSW63119.2024.00094
Huang SLiu FLi TWang ZLi HJiang L(2024)TSTC: Enabling Efficient Training via Structured Sparse Tensor Compilation2024 29th Asia and South Pacific Design Automation Conference (ASP-DAC)10.1109/ASP-DAC58780.2024.10473981(884-889)Online publication date: 22-Jan-2024
https://doi.org/10.1109/ASP-DAC58780.2024.10473981
Moroz G(2024)Sparse Tensors and Subdivision Methods for Finding the Zero Set of Polynomial EquationsComputer Algebra in Scientific Computing10.1007/978-3-031-69070-9_14(236-251)Online publication date: 1-Sep-2024
https://dl.acm.org/doi/10.1007/978-3-031-69070-9_14
Siracusa MSoria-Pardos VSgherzi FRandall JJoseph DMoretó Planas MArmejach A(2023)A Tensor Marshaling Unit for Sparse Tensor Algebra on General-Purpose ProcessorsProceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture10.1145/3613424.3614284(1332-1346)Online publication date: 28-Oct-2023
https://dl.acm.org/doi/10.1145/3613424.3614284
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents