Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3287624.3287669acmconferencesArticle/Chapter ViewAbstractPublication PagesaspdacConference Proceedingsconference-collections
research-article

On-chip memory optimization for high-level synthesis of multi-dimensional data on FPGA

Published: 21 January 2019 Publication History

Abstract

It is very challenging to design an on-chip memory architecture for high-performance kernels with large amount of computation and data. The on-chip memory architecture must support efficient data access from both the computation part and the external memory part, which often have very different expectations about how data should be accessed and stored. Previous work provides only a limited set of optimizations. In this paper we show how to fundamentally restructure on-chip buffers, by decoupling logical array view from the physical buffer view, and providing general mapping schemes for the two. Our framework considers the entire data flow from the external memory to the computation part in order to minimize resource usage without creating performance bottleneck. Our experimental results demonstrate that our proposed technique can generate solutions that reduce memory usage significantly (2X over the conventional method), and successfully generate optimized on-chip buffer architectures without costly design iterations for highly optimized computation kernels.

References

[1]
N. K. Pham, A. K. Singh, A. Kumar, and M. M. A. Khin, "Exploiting loop-array dependencies to accelerate the design space exploration with high level synthesis," in DATE '15, ser. DATE '15. San Jose, CA, USA: EDA Consortium, 2015, pp. 157--162.
[2]
G. Zhong et al., "Design space exploration of multiple loops on fpgas using high level synthesis," in ICCD '14. IEEE, 2014, pp. 456--463.
[3]
G. Zhong, "Lin-analyzer: A high-level performance analysis tool for fpga-based accelerators," in DAC '16, ser. DAC '16. New York, NY, USA: ACM, 2016, pp. 136:1--136:6.
[4]
J. Cong, P. Wei, C. H. Yu, and P. Zhou, "Bandwidth optimization through on-chip memory restructuring for HLS," in DAC '17, 2017, pp. 43:1--43:6.
[5]
H.-Y. Liu and L. P. Carloni, "On learning-based methods for design-space exploration with high-level synthesis," in DAC '13. New York, NY, USA: ACM, 2013, pp. 50:1--50:7.
[6]
B. So, M. W. Hall, and P. C. Diniz, "A compiler approach to fast hardware design space exploration in fpga-based systems," SIGPLAN Not., vol. 37, pp. 165--176, May 2002.
[7]
A. Rahman, J. Lee, and K. Choi, "Efficient FPGA acceleration of convolutional neural networks using logical-3D compute array," in DATE '16. San Jose, CA, USA: EDA Consortium, 2016, pp. 1393--1398.
[8]
A. Rahman, S. Oh, J. Lee, and K. Choi, "Design space exploration of FPGA accelerators for convolutional neural networks," in Design, Automation and Test in Europe (DATE '17), Mar. 2017, pp. 1147--1152.
[9]
S. Lee, D. Kim, D. Nguyen, and J. Lee, "Double MAC on a DSP: Boosting the performance of convolutional neural networks on FPGAs," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2018.
[10]
Vivado HLS, Xilinx Inc., http://www.xilinx.com.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ASPDAC '19: Proceedings of the 24th Asia and South Pacific Design Automation Conference
January 2019
794 pages
ISBN:9781450360074
DOI:10.1145/3287624
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

In-Cooperation

  • IEICE ESS: Institute of Electronics, Information and Communication Engineers, Engineering Sciences Society
  • IEEE CAS
  • IEEE CEDA
  • IPSJ SIG-SLDM: Information Processing Society of Japan, SIG System LSI Design Methodology

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 January 2019

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Funding Sources

Conference

ASPDAC '19
Sponsor:

Acceptance Rates

Overall Acceptance Rate 466 of 1,454 submissions, 32%

Upcoming Conference

ASPDAC '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 170
    Total Downloads
  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media