Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1109/FCCM.2007.60guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Sparse Matrix-Vector Multiplication Design on FPGAs

Published: 23 April 2007 Publication History

Abstract

Creating a high throughput sparse matrix vector multiplication (SpMxV) implementation depends on a balanced system design. In this paper, we introduce the innovative SpMxV Solver designed for FPGAs (SSF). Besides high computational throughput, system performance is optimized by reducing initialization time and overheads, minimizing and overlapping I/O operations, and increasing scalability. SSF accepts any matrix size and can be easily adapted to different data formats. SSF minimizes the control logic by taking advantage of the data flow via an innovative accumulation circuit which uses pipelined floating point adders. Compared to optimized software codes on a Pentium 4 microprocessor, our design achieves up to 20x speedup.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
FCCM '07: Proceedings of the 15th Annual IEEE Symposium on Field-Programmable Custom Computing Machines
April 2007
354 pages
ISBN:0769529402

Publisher

IEEE Computer Society

United States

Publication History

Published: 23 April 2007

Author Tags

  1. FPGA
  2. Performance
  3. Sparse matrix multiplication

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Accelerator for Sparse Machine LearningIEEE Computer Architecture Letters10.1109/LCA.2017.271466717:1(21-24)Online publication date: 1-Jan-2018
  • (2017)A novel BRAM content accessing and processing method based on FPGA configuration bitstreamMicroprocessors & Microsystems10.1016/j.micpro.2017.01.00949:C(64-76)Online publication date: 1-Mar-2017
  • (2015)AHEADProceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition10.5555/2755753.2757032(942-947)Online publication date: 9-Mar-2015
  • (2014)A Multiple-FPGA parallel computing architecture for real-time simulation of soft-object deformationACM Transactions on Embedded Computing Systems10.1145/256003113:4(1-23)Online publication date: 10-Mar-2014
  • (2013)Compiled multithreaded data paths on FPGAs for dynamic workloadsProceedings of the 2013 International Conference on Compilers, Architectures and Synthesis for Embedded Systems10.5555/2555729.2555732(1-10)Online publication date: 29-Sep-2013
  • (2012)Cellular automata-based parallel random number generators using FPGAsInternational Journal of Reconfigurable Computing10.1155/2012/2190282012(4-4)Online publication date: 1-Jan-2012
  • (2010)Haptic rendering of deformable objects using a multiple FPGA parallel computing architectureProceedings of the 18th annual ACM/SIGDA international symposium on Field programmable gate arrays10.1145/1723112.1723147(189-198)Online publication date: 21-Feb-2010
  • (2010)Sparse Matrix-Vector Multiplication on a Reconfigurable Supercomputer with ApplicationACM Transactions on Reconfigurable Technology and Systems10.1145/1661438.16614403:1(1-31)Online publication date: 1-Jan-2010
  • (2009)An integrated reduction technique for a double precision accumulatorProceedings of the Third International Workshop on High-Performance Reconfigurable Computing Technology and Applications10.1145/1646461.1646463(11-18)Online publication date: 15-Nov-2009

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media