Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3318265.3318285acmotherconferencesArticle/Chapter ViewAbstractPublication Pageshp3cConference Proceedingsconference-collections
research-article

An FPGA-based accelerator platform implements for convolutional neural network

Published: 08 March 2019 Publication History

Abstract

In recent years, convolutional neural network (CNN) has become widely universal in large number of applications including computer vision, natural language processing and automatic driving. However, the CNN-based methods are computational-intensive and resource-intensive, and thus are hard to integrate the neural network into embedded systems such as smart phones, automatic driving and robots. To address the limitation, various deep learning accelerators have been proposed to implement on the field programmable gate array (FPGA) platform, because of its flexibility and reconfigurability. In this paper, we design and implement an FPGA-based accelerator platform which integrated the NVIDIA deep learning accelerator (NVDLA). We illustrate the detail architecture of the accelerator, and give the software and hardware co-design approaches which can instruct the system designs of FPGA-based accelerator platform. As a case study, we implement the CNN accelerator on an XCZU9EG FPGA platform and our implement achieves a peak performance of 25.6 GOPS when computing the valid output of convolutional layers under 100 MHz working frequency.

References

[1]
Krizhevsky, Alex, Sutskever, Ilya, and Hinton, Geoffrey E. "Imageaent classification with deep convolutional neural networks." In NIPS, pp. 1097--1105, 2012. Bowman, M.,
[2]
Simonyan, Karen and Zisserman, Andrew. "Very deep convolutional networks for large-scale image recognition". arXiv preprint arXiv: 1409.1556, 2014.
[3]
K. He, X. Zhang, S. Ren, and J. Sun. "Deep residual learning for Image recognition." arXiv preprint arXiv: 1512.03385, 2015
[4]
T. Chen et al., "Diannao: A small-footprint high-throughput accelerator for ubiquitous machine-learning." In Proc. ACM Int. Conf. Archit. Support Program. Lang. Oper. Syst. (ASPLOS), 2014, pp. 269--284.
[5]
Y. Chen, T. Luo et al., "DaDianNao: A Machine-learning Supercomputer," in Proceedings of the 47<sup>th</sup> Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), pp. 609--622, 2015.
[6]
Norman P. Jouppi et al., TPU: In-Datacenter Performance Analysis of a Tensor Processing Unit. In ISCA. 2017, June 24-28, 2017
[7]
C. Zhang, P. Li, G. Sun, Y. Guan, B. Xiao, and J. Cong, "Optimizing FPGAbased accelerator design for deep convolutional neural networks," in FPGA, 2015.
[8]
J. Qiu, J. Wang, S. Yao, K. Guo, B. Li, E. Zhou, J. Yu, T. Tang, N. Xu, and S. Song, "Going deeper with embedded FPGA platform for convolutional neural network," in Proceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays. ACM, 2016, pp. 26--35.
[9]
Liqiang Lu, Yun Liang et al., "Evaluating Fast Algorithms for Convolutional Neural Networks on FPGAs"

Index Terms

  1. An FPGA-based accelerator platform implements for convolutional neural network

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    HP3C '19: Proceedings of the 3rd International Conference on High Performance Compilation, Computing and Communications
    March 2019
    201 pages
    ISBN:9781450366380
    DOI:10.1145/3318265
    • Conference Chair:
    • Steven Guan
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 March 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. FPGA platform
    2. NVDLA
    3. accelerator
    4. convloutional neuaral network

    Qualifiers

    • Research-article

    Conference

    HP3C '19

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 317
      Total Downloads
    • Downloads (Last 12 months)24
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 11 Feb 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media