short-paper

GAIL: the graph algorithm iron law

Authors:

Krste Asanović,

David PattersonAuthors Info & Claims

IA³ '15: Proceedings of the 5th Workshop on Irregular Applications: Architectures and Algorithms

Article No.: 13, Pages 1 - 4

https://doi.org/10.1145/2833179.2833187

Published: 15 November 2015 Publication History

Abstract

As new applications for graph algorithms emerge, there has been a great deal of research interest in improving graph processing. However, it is often difficult to understand how these new contributions improve performance. Execution time, the most commonly reported metric, distinguishes which alternative is the fastest but does not give any insight as to why. A new contribution may have an algorithmic innovation that allows it to examine fewer graph edges. It could also have an implementation optimization that reduces communication. It could even have optimizations that allow it to increase its memory bandwidth utilization. More interestingly, a new innovation may simultaneously affect all three of these factors (algorithmic work, communication volume, and memory bandwidth utilization). We present the Graph Algorithm Iron Law (GAIL) to quantify these tradeoffs to help understand graph algorithm performance.

References

[1]

V. Agarwal, F. Petrini, et al. Scalable graph exploration on multicore processors. SC, 2010.

Digital Library

[2]

S. Beamer, K. Asanović, and D. A. Patterson. The GAP benchmark suite. arXiv:1508.03619, 2015.

[3]

S. Beamer, K. Asanović, and D. A. Patterson. Direction-optimizing breadth-first search. SC, 2012.

Digital Library

[4]

S. Beamer, K. Asanović, and D. A. Patterson. Locality exists in graph processing: Workload characterization on an Ivy Bridge server. International Symposium on Workload Characterization (IISWC), 2015.

Digital Library

[5]

A. Buluç and J. Gilbert. The Combinatorial BLAS: Design, implementation, and applications. International Journal of High Performance Computing Applications (IJHPCA), 25(4):496--509, 2011.

Digital Library

[6]

J. W. Choi, D. Bedard, R. Fowler, and R. Vuduc. A roofline model of energy. In IPDPS, 2013.

Digital Library

[7]

T. Davis and Y. Hu. The University of Florida sparse matrix collection. ACM Transactions on Mathematical Software, 38:1:1--1:25, 2011.

Digital Library

[8]

9th DIMACS implementation challenge. http://www.dis.uniroma1.it/challenge9/, 2006.

[9]

J. Emer and D. Clark. A characterization of processor performance in the VAX-11/780. ISCA, 1984.

Digital Library

[10]

GAP benchmark suite reference code v0.6. https://github.com/sbeamer/gapbs.

[11]

Graph500 benchmark. www.graph500.org.

[12]

O. Green, M. Dukhan, and R. Vuduc. Branch-avoiding graph algorithms. SPAA, 2015.

Digital Library

[13]

Intel performance counter monitor. www.intel.com/software/pcm.

[14]

A. Kyrola, G. Blelloch, and C. Guestrin. GraphChi: Large-scale graph computation on just a PC. Symposium on Operating Systems Design and Implementation, 2012.

Digital Library

[15]

J. Leskovec, D. Chakrabarti, et al. Realistic, mathematically tractable graph generation and evolution, using Kronecker multiplication. European Conference on Principles and Practice of Knowledge Discovery in Databases, 2005.

Digital Library

[16]

D. A. Patterson and J. L. Hennessy. Computer Organization and Design: The Hardware/Software Interface. Morgan Kaufmann, 4th edition, 2008.

Digital Library

[17]

K. D. Underwood, M. Vance, J. Berry, and B. Hendrickson. Analyzing the scalability of graph algorithms on Eldorado. In IPDPS, 2007.

Cited By

Elmougy YHayashi ASarkar V(2024)Asynchronous Distributed Actor-Based Approach to Jaccard Similarity for Genome ComparisonsISC High Performance 2024 Research Paper Proceedings (39th International Conference)10.23919/ISC.2024.10528922(1-11)Online publication date: May-2024
https://doi.org/10.23919/ISC.2024.10528922
Fariborz MSamani MO'Neill TLowe-Power JYoo SAkella V(2022)A Model for Scalable and Balanced Accelerators for Graph ProcessingIEEE Computer Architecture Letters10.1109/LCA.2022.321548921:2(149-152)Online publication date: 1-Jul-2022
https://doi.org/10.1109/LCA.2022.3215489
Vandierendonck HAyguadé EHwu WBadia RHofstee H(2020)GraptorProceedings of the 34th ACM International Conference on Supercomputing10.1145/3392717.3392753(1-13)Online publication date: 29-Jun-2020
https://dl.acm.org/doi/10.1145/3392717.3392753
Show More Cited By

Recommendations

On the Multichromatic Number of s-Stable Kneser Graphs

For positive integers n and s, a subset Sï [n] is s-stable if sï |i-j|ï n-s for distinct i,j∈S . The s-stable r-uniform Kneser hypergraph KGrn,ks-stable is the r-uniform hypergraph that has the collection of all s-stable k-element subsets of [n] as ...
Adjacent vertex-distinguishing edge and total chromatic numbers of hypercubes

An adjacent vertex-distinguishing edge coloring of a simple graph G is a proper edge coloring of G such that incident edge sets of any two adjacent vertices are assigned different sets of colors. A total coloring of a graph G is a coloring of both the ...
Forbidden Subgraphs and Weak Locally Connected Graphs

A graph is called H-free if it has no induced subgraph isomorphic to H. A graph is called $$N^i$$Ni-locally connected if $$G[\{ x\in V(G): 1\le d_G(w, x)\le i\}]$$G[{x?V(G):1≤dG(w,x)≤i}] is connected and $$N_2$$N2-locally connected if $$G[\{uv: \{uw, vw\...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

IA³ '15: Proceedings of the 5th Workshop on Irregular Applications: Architectures and Algorithms

November 2015

79 pages

ISBN:9781450340014

DOI:10.1145/2833179

Conference Chairs:
Antonino Tumeo
Pacific Northwest National Laboratory
,
John Feo
Pacific Northwest National Laboratory
,
Oreste Villa
NVIDIA

Copyright © 2015 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGHPC: ACM Special Interest Group on High Performance Computing, Special Interest Group on High Performance Computing
SIGARCH: ACM Special Interest Group on Computer Architecture
IEEE-CS\DATC: IEEE Computer Society

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 November 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Short-paper

Conference

SC15

Sponsor:

SIGHPC
SIGARCH
IEEE-CS\DATC

SC15: The International Conference for High Performance Computing, Networking, Storage and Analysis

November 15, 2015

Texas, Austin

Acceptance Rates

IA³ '15 Paper Acceptance Rate 6 of 24 submissions, 25%;

Overall Acceptance Rate 18 of 67 submissions, 27%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
162
Total Downloads

Downloads (Last 12 months)9
Downloads (Last 6 weeks)0

Reflects downloads up to 16 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Elmougy YHayashi ASarkar V(2024)Asynchronous Distributed Actor-Based Approach to Jaccard Similarity for Genome ComparisonsISC High Performance 2024 Research Paper Proceedings (39th International Conference)10.23919/ISC.2024.10528922(1-11)Online publication date: May-2024
https://doi.org/10.23919/ISC.2024.10528922
Fariborz MSamani MO'Neill TLowe-Power JYoo SAkella V(2022)A Model for Scalable and Balanced Accelerators for Graph ProcessingIEEE Computer Architecture Letters10.1109/LCA.2022.321548921:2(149-152)Online publication date: 1-Jul-2022
https://doi.org/10.1109/LCA.2022.3215489
Vandierendonck HAyguadé EHwu WBadia RHofstee H(2020)GraptorProceedings of the 34th ACM International Conference on Supercomputing10.1145/3392717.3392753(1-13)Online publication date: 29-Jun-2020
https://dl.acm.org/doi/10.1145/3392717.3392753
Besta MPodstawski MGroner LSolomonik EHoefler THuang HWeissman JIamnitchi AIosup A(2017)To Push or To PullProceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing10.1145/3078597.3078616(93-104)Online publication date: 26-Jun-2017
https://dl.acm.org/doi/10.1145/3078597.3078616
Beamer SAsanovic KPatterson D(2017)Reducing Pagerank Communication via Propagation Blocking2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS.2017.112(820-831)Online publication date: May-2017
https://doi.org/10.1109/IPDPS.2017.112

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents