Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2833179.2833187acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
short-paper

GAIL: the graph algorithm iron law

Published: 15 November 2015 Publication History

Abstract

As new applications for graph algorithms emerge, there has been a great deal of research interest in improving graph processing. However, it is often difficult to understand how these new contributions improve performance. Execution time, the most commonly reported metric, distinguishes which alternative is the fastest but does not give any insight as to why. A new contribution may have an algorithmic innovation that allows it to examine fewer graph edges. It could also have an implementation optimization that reduces communication. It could even have optimizations that allow it to increase its memory bandwidth utilization. More interestingly, a new innovation may simultaneously affect all three of these factors (algorithmic work, communication volume, and memory bandwidth utilization). We present the Graph Algorithm Iron Law (GAIL) to quantify these tradeoffs to help understand graph algorithm performance.

References

[1]
V. Agarwal, F. Petrini, et al. Scalable graph exploration on multicore processors. SC, 2010.
[2]
S. Beamer, K. Asanović, and D. A. Patterson. The GAP benchmark suite. arXiv:1508.03619, 2015.
[3]
S. Beamer, K. Asanović, and D. A. Patterson. Direction-optimizing breadth-first search. SC, 2012.
[4]
S. Beamer, K. Asanović, and D. A. Patterson. Locality exists in graph processing: Workload characterization on an Ivy Bridge server. International Symposium on Workload Characterization (IISWC), 2015.
[5]
A. Buluç and J. Gilbert. The Combinatorial BLAS: Design, implementation, and applications. International Journal of High Performance Computing Applications (IJHPCA), 25(4):496--509, 2011.
[6]
J. W. Choi, D. Bedard, R. Fowler, and R. Vuduc. A roofline model of energy. In IPDPS, 2013.
[7]
T. Davis and Y. Hu. The University of Florida sparse matrix collection. ACM Transactions on Mathematical Software, 38:1:1--1:25, 2011.
[8]
9th DIMACS implementation challenge. http://www.dis.uniroma1.it/challenge9/, 2006.
[9]
J. Emer and D. Clark. A characterization of processor performance in the VAX-11/780. ISCA, 1984.
[10]
GAP benchmark suite reference code v0.6. https://github.com/sbeamer/gapbs.
[11]
Graph500 benchmark. www.graph500.org.
[12]
O. Green, M. Dukhan, and R. Vuduc. Branch-avoiding graph algorithms. SPAA, 2015.
[13]
Intel performance counter monitor. www.intel.com/software/pcm.
[14]
A. Kyrola, G. Blelloch, and C. Guestrin. GraphChi: Large-scale graph computation on just a PC. Symposium on Operating Systems Design and Implementation, 2012.
[15]
J. Leskovec, D. Chakrabarti, et al. Realistic, mathematically tractable graph generation and evolution, using Kronecker multiplication. European Conference on Principles and Practice of Knowledge Discovery in Databases, 2005.
[16]
D. A. Patterson and J. L. Hennessy. Computer Organization and Design: The Hardware/Software Interface. Morgan Kaufmann, 4th edition, 2008.
[17]
K. D. Underwood, M. Vance, J. Berry, and B. Hendrickson. Analyzing the scalability of graph algorithms on Eldorado. In IPDPS, 2007.

Cited By

View all
  • (2024)Asynchronous Distributed Actor-Based Approach to Jaccard Similarity for Genome ComparisonsISC High Performance 2024 Research Paper Proceedings (39th International Conference)10.23919/ISC.2024.10528922(1-11)Online publication date: May-2024
  • (2022)A Model for Scalable and Balanced Accelerators for Graph ProcessingIEEE Computer Architecture Letters10.1109/LCA.2022.321548921:2(149-152)Online publication date: 1-Jul-2022
  • (2020)GraptorProceedings of the 34th ACM International Conference on Supercomputing10.1145/3392717.3392753(1-13)Online publication date: 29-Jun-2020
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
IA3 '15: Proceedings of the 5th Workshop on Irregular Applications: Architectures and Algorithms
November 2015
79 pages
ISBN:9781450340014
DOI:10.1145/2833179
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 November 2015

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Short-paper

Conference

SC15
Sponsor:

Acceptance Rates

IA3 '15 Paper Acceptance Rate 6 of 24 submissions, 25%;
Overall Acceptance Rate 18 of 67 submissions, 27%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Asynchronous Distributed Actor-Based Approach to Jaccard Similarity for Genome ComparisonsISC High Performance 2024 Research Paper Proceedings (39th International Conference)10.23919/ISC.2024.10528922(1-11)Online publication date: May-2024
  • (2022)A Model for Scalable and Balanced Accelerators for Graph ProcessingIEEE Computer Architecture Letters10.1109/LCA.2022.321548921:2(149-152)Online publication date: 1-Jul-2022
  • (2020)GraptorProceedings of the 34th ACM International Conference on Supercomputing10.1145/3392717.3392753(1-13)Online publication date: 29-Jun-2020
  • (2017)To Push or To PullProceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing10.1145/3078597.3078616(93-104)Online publication date: 26-Jun-2017
  • (2017)Reducing Pagerank Communication via Propagation Blocking2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS.2017.112(820-831)Online publication date: May-2017

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media