Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1109/FCCM.2006.45guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

GraphStep: A System Architecture for Sparse-Graph Algorithms

Published: 24 April 2006 Publication History

Abstract

Many important applications are organized around long-lived, irregular sparse graphs (e.g., data and knowledge bases, CAD optimization, numerical problems, simulations). The graph structures are large, and the applications need regular access to a large, data-dependent portion of the graph for each operation (e.g., the algorithm may need to walk the graph, visiting all nodes, or propagate changes through many nodes in the graph). On conventional microprocessors, the graph structures exceed on-chip cache capacities, making main-memory bandwidth and latency the key performance limiters. To avoid this "memory wall," we introduce a concurrent system architecture for sparse graph algorithms that places graph nodes in small distributed memories paired with specialized graph processing nodes interconnected by a lightweight network. This gives us a scalable way to map these applications so that they can exploit the high-bandwidth and low-latency capabilities of embedded memories (e.g., FPGA Block RAMs). On typical spreadingactivation queries on the ConceptNet Knowledge Base, a sample application, this translates into an order of magnitude speedup per FPGA compared to a state-of-the-art Pentium processor.

Cited By

View all
  • (2024)Dynamic-ACTS - A Dynamic Graph Analytics Accelerator For HBM-Enabled FPGAsACM Transactions on Reconfigurable Technology and Systems10.1145/366200217:3(1-29)Online publication date: 30-Apr-2024
  • (2024)ScalaBFS2: A High-performance BFS Accelerator on an HBM-enhanced FPGA ChipACM Transactions on Reconfigurable Technology and Systems10.1145/365003717:2(1-39)Online publication date: 29-Feb-2024
  • (2023)ACTS: A Near-Memory FPGA Graph Processing FrameworkProceedings of the 2023 ACM/SIGDA International Symposium on Field Programmable Gate Arrays10.1145/3543622.3573180(79-89)Online publication date: 12-Feb-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
FCCM '06: Proceedings of the 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines
April 2006
351 pages
ISBN:0769526616

Publisher

IEEE Computer Society

United States

Publication History

Published: 24 April 2006

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Dynamic-ACTS - A Dynamic Graph Analytics Accelerator For HBM-Enabled FPGAsACM Transactions on Reconfigurable Technology and Systems10.1145/366200217:3(1-29)Online publication date: 30-Apr-2024
  • (2024)ScalaBFS2: A High-performance BFS Accelerator on an HBM-enhanced FPGA ChipACM Transactions on Reconfigurable Technology and Systems10.1145/365003717:2(1-39)Online publication date: 29-Feb-2024
  • (2023)ACTS: A Near-Memory FPGA Graph Processing FrameworkProceedings of the 2023 ACM/SIGDA International Symposium on Field Programmable Gate Arrays10.1145/3543622.3573180(79-89)Online publication date: 12-Feb-2023
  • (2019)Improving Performance of Graph Processing on FPGA-DRAM Platform by Two-level Vertex CachingProceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays10.1145/3289602.3293900(320-329)Online publication date: 20-Feb-2019
  • (2018)A Template-Based Design Methodology for Graph-Parallel Hardware AcceleratorsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2017.270656237:2(420-430)Online publication date: 1-Feb-2018
  • (2017)Towards Flexible Automatic Generation of Graph Processing GatewareProceedings of the 8th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies10.1145/3120895.3120896(1-6)Online publication date: 7-Jun-2017
  • (2017)FPGA-Accelerated Transactional Execution of Graph WorkloadsProceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays10.1145/3020078.3021743(227-236)Online publication date: 22-Feb-2017
  • (2017)A BSP model graph processing system on many coresCluster Computing10.1007/s10586-017-0829-020:2(1359-1377)Online publication date: 1-Jun-2017
  • (2016)Energy efficient architecture for graph analytics acceleratorsACM SIGARCH Computer Architecture News10.1145/3007787.300115544:3(166-177)Online publication date: 18-Jun-2016
  • (2016)GraphOpsProceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays10.1145/2847263.2847337(111-117)Online publication date: 21-Feb-2016
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media