Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3468044acmotherconferencesBook PagePublication PagesheartConference Proceedingsconference-collections
HEART '21: Proceedings of the 11th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies
ACM2021 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
HEART '21: International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies Online Germany June 21 - 23, 2021
ISBN:
978-1-4503-8549-7
Published:
21 June 2021
In-Cooperation:
German Research Foundation

Reflects downloads up to 30 Sep 2024Bibliometrics
Abstract

No abstract available.

Skip Table Of Content Section
SESSION: Invited Presentations
invited-talk
Large-scale combinatorial optimization in real-time systems by FPGA-based accelerators for simulated bifurcation
Article No.: 1, Pages 1–6https://doi.org/10.1145/3468044.3468045

Combinatorial optimization problems are economically valuable but computationally hard to solve. Many practical combinatorial optimizations can be converted to the ground-state search problems of Ising spin models. Simulated bifurcation (SB) is a ...

invited-talk
On the Inevitability of Integrated HPC Systems and How they will Change HPC System Operations
Article No.: 2, Pages 1–6https://doi.org/10.1145/3468044.3468046

High-Performance Computing (HPC) is at an inflection point in its evolution. General-purpose architectures approach limits in terms of speed and power/energy, requiring the development of specialized architectures to deliver accelerated performance. ...

SESSION: Full Paper Presentations
research-article
Software-like Compilation for Data Center FPGA Accelerators
Article No.: 3, Pages 1–6https://doi.org/10.1145/3468044.3468047

Compilation times for large Xilinx devices, such as the Amazon F1 instance, are on the order of several hours. However, today's data center designs often have many identical processing units (PUs), meaning that conventional design flows waste time ...

research-article
Automation of Domain-specific FPGA-IP Generation and Test
Article No.: 4, Pages 1–6https://doi.org/10.1145/3468044.3468048

Multi-access edge computing (MEC) devices that perform processing between the edge and cloud are becoming important in the Internet of Things infrastructure. MEC devices are designed to reduce the load on the edge devices, ensure real-time performance, ...

research-article
A programming environment for multi-FPGA systems based on CyberWorkBench: an integrated design tool
Article No.: 5, Pages 1–6https://doi.org/10.1145/3468044.3468049

This paper proposes a multi-FPGA programming environment based on NEC's integrated design tool CyberWorkBench (CWB) for a multi-FPGA system FiC (Flow-in-Cloud). Programmers describe their program in SystemC as small modules connected with FIFO channels, ...

research-article
Accelerating Matrix Processing for MIMO Systems
Article No.: 6, Pages 1–6https://doi.org/10.1145/3468044.3468050

Massive Multiple In and Multiple Out (MIMO) is being used in the fifth generation of wireless communication systems. As the number of antennas increases, the computational complexity grows dramatically, and this involves matrix calculations with complex ...

research-article
Kyokko: a vendor-independent high-speed serial communication controller
Article No.: 7, Pages 1–6https://doi.org/10.1145/3468044.3468051

With the advancement of HLS technology, FPGA is finally drawing attention as a power-efficient accelerator device. Unlike GPUs, the computation pipeline and FPGA-to-FPGA interconnection can be tightly coupled on FPGAs because they have high-speed serial ...

research-article
StreamBrain: An HPC Framework for Brain-like Neural Networks on CPUs, GPUs and FPGAs
Article No.: 8, Pages 1–6https://doi.org/10.1145/3468044.3468052

The modern deep learning method based on backpropagation has surged in popularity and has been used in multiple domains and application areas. At the same time, there are other - less-known - machine learning algorithms with a mature and solid ...

research-article
Benchmarking the Nvidia GPU Lineage: From Early K80 to Modern A100 with Asynchronous Memory Transfers
Article No.: 9, Pages 1–6https://doi.org/10.1145/3468044.3468053

For many, Graphics Processing Units (GPUs) provides a source of reliable computing power. Recently, Nvidia introduced its 9th generation HPC-grade GPUs, the Ampere 100 (A100), claiming significant performance improvements over previous generations, ...

research-article
Open Access
A Sorting Library for FPGA Implementation in OpenCL Programming
Article No.: 10, Pages 1–6https://doi.org/10.1145/3468044.3468054

In this study, we focus on data sorting, which is a basic arithmetic operation, and we present a sorting library that can be used with the OpenCL programming model for field-programmable gate arrays (FPGAs). Our sorting library is built by combining ...

SESSION: PhD Forum Presentations
extended-abstract
extended-abstract
extended-abstract
FPGA Acceleration of Short Read Alignment
Article No.: 13, Pages 1–2https://doi.org/10.1145/3468044.3468057

This work proposes a novel dataflow architecture for Smith-Waterman Matrix-fill and Traceback stages, which are at the heart of short-read alignment on NGS data. The FPGA accelerator is coupled with radical software restructuring to widely-used Bowtie2 ...

extended-abstract
Towards Performance Characterization of FPGAs in Context of HPC using OpenCL Benchmarks
Article No.: 14, Pages 1–2https://doi.org/10.1145/3468044.3468058

OpenCL-based HLS frameworks are known to reduce the development effort for FPGAs while offering good quality results. The upcoming trend to equip heterogeneous compute clusters with hybrid networks for inter-CPU and inter-FPGA communication allows ...

extended-abstract
CoopCL: A Framework for Cooperative Execution of Data-parallel Kernels on Multi-device Platforms
Article No.: 16, Pages 1–2https://doi.org/10.1145/3468044.3468061

This paper describes CoopCL framework that is specifically designed to reduce the multi-device programming complexity. CoopCL consists of a three core components: a C++ API, custom-compiler and a runtime that abstracts and unifies the cooperative ...

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Acceptance Rates

Overall Acceptance Rate 22 of 50 submissions, 44%
YearSubmittedAcceptedRate
HEART '22211048%
HEART '19291241%
Overall502244%