TECS: Vol 22, No 6

Volume 22, Issue 6November 2023

Volume 22, Issue 6

November 2023

Editor:

Tulika Mitra
National University of Singapore, Singapore

Publisher:

Association for Computing Machinery
New York
NY
United States

ISSN:1539-9087

EISSN:1558-3465

Tags:

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Bibliometrics

Issue Downloads

PDFfront matter (TOC, masthead, submission information)

Select All

Export Citations Save to Binder

SECTION: Special Issue on AI Acceleration on FPGAS

introduction

Free

Special Issue: “AI Acceleration on FPGAs”

Article No.: 89, Pages 1–3https://doi.org/10.1145/3626323

research-article

High-performance Reconfigurable DNN Accelerator on a Bandwidth-limited Embedded System

Article No.: 90, Pages 1–20https://doi.org/10.1145/3530818

Deep convolutional neural networks (DNNs) have been widely used in many applications, particularly in machine vision. It is challenging to accelerate DNNs on embedded systems because real-world machine vision applications should reserve a lot of external ...

research-article

FD-CNN: A Frequency-Domain FPGA Acceleration Scheme for CNN-Based Image-Processing Applications

Article No.: 91, Pages 1–30https://doi.org/10.1145/3559105

In the emerging edge-computing scenarios, FPGAs have been widely adopted to accelerate convolutional neural network (CNN)–based image-processing applications, such as image classification, object detection, and image segmentation, and so on. A standard ...

research-article

Open Access

An Intermediate-Centric Dataflow for Transposed Convolution Acceleration on FPGA

Article No.: 92, Pages 1–22https://doi.org/10.1145/3561053

Transposed convolution has been prevailing in convolutional neural networks (CNNs), playing an important role in multiple scenarios such as image segmentation and back-propagation process of training CNNs. This mainly benefits from the ability to up-...

research-article

Accelerating Attention Mechanism on FPGAs based on Efficient Reconfigurable Systolic Array

Article No.: 93, Pages 1–22https://doi.org/10.1145/3549937

Transformer model architectures have recently received great interest in natural language, machine translation, and computer vision, where attention mechanisms are their building blocks. However, the attention mechanism is expensive because of its ...

research-article

On the RTL Implementation of FINN Matrix Vector Unit

Article No.: 94, Pages 1–27https://doi.org/10.1145/3547141

Field-programmable gate array (FPGA)–based accelerators are becoming increasingly popular for deep neural network (DNN) inference due to their ability to scale performance with increasing degrees of specialization with dataflow architectures or custom ...

research-article

ACDSE: A Design Space Exploration Method for CNN Accelerator based on Adaptive Compression Mechanism

Article No.: 95, Pages 1–26https://doi.org/10.1145/3545177

Customized accelerators for Convolutional Neural Network (CNN) can achieve better energy efficiency than general computing platforms. However, the design of a high-performance accelerator should take into account a variety of parameters and physical ...

research-article

Open Access

TH-iSSD: Design and Implementation of a Generic and Reconfigurable Near-Data Processing Framework

Article No.: 96, Pages 1–23https://doi.org/10.1145/3563456

We present the design and implementation of TH-iSSD, a near-data processing framework to address the data movement problem. TH-iSSD does not pose any restriction to the hardware selection and is highly reconfigurable—its core components, such as the on-...

Subjects

Comments

Please enable JavaScript to view thecomments powered by Disqus.

ACM Transactions on Embedded Computing Systems

Sections

Issue Downloads

Special Issue: “AI Acceleration on FPGAs”

High-performance Reconfigurable DNN Accelerator on a Bandwidth-limited Embedded System

FD-CNN: A Frequency-Domain FPGA Acceleration Scheme for CNN-Based Image-Processing Applications

An Intermediate-Centric Dataflow for Transposed Convolution Acceleration on FPGA

Accelerating Attention Mechanism on FPGAs based on Efficient Reconfigurable Systolic Array

On the RTL Implementation of FINN Matrix Vector Unit

ACDSE: A Design Space Exploration Method for CNN Accelerator based on Adaptive Compression Mechanism

TH-iSSD: Design and Implementation of a Generic and Reconfigurable Near-Data Processing Framework

RegKey: A Register-based Implementation of ECC Signature Algorithms Against One-shot Memory Disclosure

SensiX++: Bringing MLOps and Multi-tenant Model Serving to Sensory Edge Devices

Scheduling Dynamic Software Updates in Mobile Robots

Online Distributed Schedule Randomization to Mitigate Timing Attacks in Industrial Control Systems

SG-Float: Achieving Memory Access and Computing Power Reduction Using Self-Gating Float in CNNs

Energy-Efficient Communications for Improving Timely Progress of Intermittent-Powered BLE Devices

A Comprehensive Model for Efficient Design Space Exploration of Imprecise Computational Blocks

Dynamic Thermal Management of 3D Memory through Rotating Low Power States and Partial Channel Closure

Enabling Binary Neural Network Training on the Edge

Design and Analysis of High Performance Heterogeneous Block-based Approximate Adders