Sudrajat et al., 2019 - Google Patents

GEMM-Based Quantized Neural Network FPGA Accelerator Design

Sudrajat et al., 2019

Document ID: 1817960969226360832
Author: Sudrajat M; Adiono T; Syafalni I
Publication year: 2019
Publication venue: 2019 International Symposium on Electronics and Smart Devices (ISESD)

External Links

Cited by

Snippet

In this study, we will explore Neural Network based FPGA acceleration based on accelerating General Matrix Multiplication (GEMM). GEMM acceleration allows regularized and modular implementation of accelerator design, as well as providing the benefits of …

Continue reading at ieeexplore.ieee.org (other versions)

230000001537 neural 0 title abstract description 23

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/38—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
- G06F7/48—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
- G06F7/52—Multiplying; Dividing
- G06F7/523—Multiplying only
- G06F7/53—Multiplying only in parallel-parallel fashion, i.e. both operands being entered in parallel
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/14—Fourier, Walsh or analogous domain transformations, e.g. Laplace, Hilbert, Karhunen-Loeve, transforms
- G06F17/141—Discrete Fourier transforms
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
- G06F15/78—Architectures of general purpose stored programme computers comprising a single central processing unit
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- H—ELECTRICITY
- H03—BASIC ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same information or similar information or a subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction

Similar Documents

Publication	Publication Date	Title
Jang et al.	2021	Sparsity-aware and re-configurable NPU architecture for Samsung flagship mobile SoC
US20210374503A1 (en)	2021-12-02	Network-centric architecture and algorithms to accelerate distributed training of neural networks
CN109543816B (en)	2022-12-06	Convolutional neural network calculation method and system based on weight kneading
US20180197084A1 (en)	2018-07-12	Convolutional neural network system having binary parameter and operation method thereof
CN108090560A (en)	2018-05-29	The design method of LSTM recurrent neural network hardware accelerators based on FPGA
CN110555516B (en)	2023-10-27	Implementation method of low-latency hardware accelerator for YOLOv2-tiny neural network based on FPGA
US11948069B2 (en)	2024-04-02	Compression of neural network activation data
EP3637327B1 (en)	2023-09-13	Computing device and method
CN110110852B (en)	2023-04-07	Method for transplanting deep learning network to FPAG platform
Struharik et al.	2018	Conna–compressed cnn hardware accelerator
Piyasena et al.	2019	Reducing dynamic power in streaming CNN hardware accelerators by exploiting computational redundancies
Li et al.	2022	An efficient CNN accelerator using inter-frame data reuse of videos on FPGAs
Wu et al.	2022	Skeletongcn: a simple yet effective accelerator for gcn training
Zhao et al.	2024	Hdsuper: High-quality and high computational utilization edge super-resolution accelerator with hardware-algorithm co-design techniques
Sudrajat et al.	2019	GEMM-Based Quantized Neural Network FPGA Accelerator Design
Xiao et al.	2021	Research on FPGA based convolutional neural network acceleration method
Zhou et al.	2020	Design and implementation of YOLOv3-Tiny accelerator based on PYNQ-Z2 heterogeneous platform
CN114003201A (en)	2022-02-01	Matrix transformation method, device and convolutional neural network accelerator
Xiao et al.	2021	A mobilenet accelerator with high processing-element-efficiency on fpga
Sharma et al.	2021	Hardware accelerator for object detection using tiny YOLO-v3
Jo et al.	2020	Bit-serial multiplier based neural processing element with approximate adder tree
Wu et al.	2019	Accelerator design for vector quantized convolutional neural network
Li et al.	2023	A 0.13 mJ/Prediction CIFAR-100 Raster-Scan-Based Wired-Logic Processor Using Non-Linear Neural Network
US20220121915A1 (en)	2022-04-21	Configurable bnn asic using a network of programmable threshold logic standard cells
Wu et al.	2019	An energy-efficient accelerator with relative-indexing memory for sparse compressed convolutional neural network