Author: He, Zhezhi : Search

Applied Filters

People

Publications

Conferences

Publication Date

44 Results for: Author: He, ZhezhiEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,790,105 records)|Limit your search to The ACM Full-Text Collection (766,390 records)

Showing 1 - 20of44 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

Article
November 2024
BKDSNN: Enhancing the Performance of Learning-Based Spiking Neural Networks Training with Blurred Knowledge Distillation
Computer Vision – ECCV 2024Pages 106–123https://doi.org/10.1007/978-3-031-72973-7_7
Abstract
Spiking neural networks (SNNs), which mimic biological neural systems to convey information via discrete spikes, are well-known as brain-inspired models with excellent computing efficiency. By utilizing the surrogate gradient estimation for ...
0
Metrics
Total Citations0
Article
September 2024
Obtaining Optimal Spiking Neural Network in Sequence Learning via CRNN-SNN Conversion
- Jiahao Su,
- Kang You,
- Zekai Xu,
- Weizhi Xu,
- Zhezhi He
Artificial Neural Networks and Machine Learning – ICANN 2024Pages 392–406https://doi.org/10.1007/978-3-031-72359-9_29
Abstract
Spiking neural networks (SNNs) are becoming a promising alternative to conventional artificial neural networks (ANNs) due to their rich neural dynamics and the implementation of energy-efficient neuromorphic chips. However, the non-differential ...
0
Metrics
Total Citations0
Article
August 2024
Watt: A Write-Optimized RRAM-Based Accelerator for Attention
Euro-Par 2024: Parallel ProcessingPages 107–120https://doi.org/10.1007/978-3-031-69766-1_8
Abstract
Attention-based models, such as Transformer and BERT, have achieved remarkable success across various tasks. However, their deployment is hindered by challenges such as high memory requirements, long inference latency, and significant power ... $^{}$ $^{}$
0
Metrics
Total Citations0
research-article
June 2023
VSPIM: SRAM Processing-in-Memory DNN Acceleration via Vector-Scalar Operations
- Chen Nie,
- Chenyu Tang,
- Jie Lin,
- Huan Hu,
- Chenyang Lv,
- Ting Cao,
- Weifeng Zhang,
- Li Jiang,
- Xiaoyao Liang,
- Weikang Qian,
- Yanan Sun,
- Zhezhi He
IEEE Transactions on Computers (ITCO), Volume 73, Issue 10Pages 2378–2390https://doi.org/10.1109/TC.2023.3285095
Processing-in-Memory (PIM) has been widely explored for accelerating data-intensive machine learning computation that mainly consists of general-matrix-multiplication (GEMM), by mitigating the burden of data movements and exploiting the ultra-high memory ...
0
Metrics
Total Citations0
short-paper
June 2023
XMG-GPPIC: Efficient and Robust General-Purpose Processing-in-Cache with XOR-Majority-Graph
GLSVLSI '23: Proceedings of the Great Lakes Symposium on VLSI 2023Pages 183–187https://doi.org/10.1145/3583781.3590288

Recent advances in processing-in-cache (PIC) have enabled generalpurpose, high-performance computation with bit-serial computing techniques. Its outstanding performance relies on efficient hardware design, and also the software stack (i.e., Logic ...
0
56
Metrics
Total Citations0
Total Downloads56
Last 12 Months22
Last 6 weeks1
Get Access
Upcoming Conferences
Skip slideshow

ASPDAC '25

January 20 - 23, 2025

Tokyo Odaiba Miraikan, Japan, Tokyo, Japan

ASPDAC '25 Website

DATE '25

March 31 - April 2, 2025

Centre Congr?s de Lyon, Lyon, France

DATE '25 Website

DAC '25

June 22 - 26, 2025

Moscone Center, San Francisco, CA, USA

DAC '25 Website

CIKM '25

November 10 - 14, 2025

COEX, Seoul, Republic of Korea
section
December 2022
Session details: Architecture for DNN Acceleration (Virtual)
- Zhezhi He
ICCAD '22: Proceedings of the 41st IEEE/ACM International Conference on Computer-Aided Designhttps://doi.org/10.1145/3578439
0
Metrics
Total Citations0
research-article
August 2022
EBSP: evolving bit sparsity patterns for hardware-friendly inference of quantized deep neural networks
DAC '22: Proceedings of the 59th ACM/IEEE Design Automation ConferencePages 259–264https://doi.org/10.1145/3489517.3530660

Model compression has been extensively investigated for supporting efficient neural network inference on edge-computing platforms due to the huge model size and computation amount. Recent researches embrace joint-way compression across multiple ...
4
368
Metrics
Total Citations4
Total Downloads368
Last 12 Months86
Last 6 weeks19
Get Access
research-article
August 2022
SATO: spiking neural network acceleration via temporal-oriented dataflow and architecture
DAC '22: Proceedings of the 59th ACM/IEEE Design Automation ConferencePages 1105–1110https://doi.org/10.1145/3489517.3530592

Event-driven spiking neural networks (SNNs) have shown great promise for being strikingly energy-efficient. SNN neurons integrate the spikes, accumulate the membrane potential, and fire output spike when the potential exceeds a threshold. Existing SNN ...
6
421
Metrics
Total Citations6
Total Downloads421
Last 12 Months168
Last 6 weeks29
Get Access
research-article
August 2022
PIM-DH: ReRAM-based processing-in-memory architecture for deep hashing acceleration
- Fangxin Liu,
- Wenbo Zhao,
- Yongbiao Chen,
- Zongwu Wang,
- Zhezhi He,
- Rui Yang,
- Qidong Tang,
- Tao Yang,
- Cheng Zhuo,
- Li Jiang
DAC '22: Proceedings of the 59th ACM/IEEE Design Automation ConferencePages 1087–1092https://doi.org/10.1145/3489517.3530575

Deep hashing has gained growing momentum in large-scale image retrieval. However, deep hashing is computation- and memory-intensive, which demands hardware acceleration. The unique process of hash sequence computation in deep hashing is non-trivial to ...
5
368
Metrics
Total Citations5
Total Downloads368
Last 12 Months114
Last 6 weeks5
Get Access
research-article
May 2022
Self-terminating write of multi-level cell ReRAM for efficient neuromorphic computing
- Zongwu Wang,
- Zhezhi He,
- Rui Yang,
- Shiquan Fan,
- Jie Lin,
- Fangxin Liu,
- Yueyang Jia,
- Chenxi Yuan,
- Qidong Tang,
- Li Jiang
DATE '22: Proceedings of the 2022 Conference & Exhibition on Design, Automation & Test in EuropePages 1251–1256

The Resistive Random-Access-Memory (ReRAM) in crossbar structure has shown great potential in accelerating the vector-matrix multiplication, owing to the fascinating computing complexity reduction (from O(n²) to O(1)). Nevertheless, the ReRAM cells ...
0
30
Metrics
Total Citations0
Total Downloads30
Last 12 Months11
Last 6 weeks0
Get Access
research-article
May 2022
DTQAtten: leveraging dynamic token-based quantization for efficient attention architecture
DATE '22: Proceedings of the 2022 Conference & Exhibition on Design, Automation & Test in EuropePages 700–705

Models based on the attention mechanism, i.e. transformers, have shown extraordinary performance in Natural Language Processing (NLP) tasks. However, their memory footprint, inference latency, and power consumption are still prohibitive for efficient ...
0
34
Metrics
Total Citations0
Total Downloads34
Last 12 Months7
Last 6 weeks0
Get Access
research-article
February 2022
N3H-Core: Neuron-designed Neural Network Accelerator via FPGA-based Heterogeneous Computing Cores
FPGA '22: Proceedings of the 2022 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysPages 112–122https://doi.org/10.1145/3490422.3502367

Accelerating the neural network inference by FPGA has emerged as a popular option, since the reconfigurability and high performance computing capability of FPGA intrinsically satisfies the computation demand of the fast-evolving neural algorithms. ...
9
621
Metrics
Total Citations9
Total Downloads621
Last 12 Months159
Last 6 weeks16
1
Supplementary Material
FPGA22#170.mp4
Get Access
research-article
January 2022
HAWIS: Hardware-Aware Automated WIdth Search for Accurate, Energy-Efficient and Robust Binary Neural Network on ReRAM Dot-Product Engine
ASPDAC '22: Proceedings of the 27th Asia and South Pacific Design Automation ConferencePages 226–231https://doi.org/10.1109/ASP-DAC52403.2022.9712542

Binary Neural Networks (BNNs) have attracted tremendous attention in ReRAM-based Process-In-Memory (PIM) systems, since they significantly simplify the hardware-expensive peripheral circuits and memory footprint. Meanwhile, BNNs are proven to have ...
1
4
Metrics
Total Citations1
Total Downloads4
Last 12 Months4
Last 6 weeks0
Get Access
research-article
December 2021
PIMGCN: A ReRAM-Based PIM Design for Graph Convolutional Network Acceleration
- Tao Yang,
- Dongyue Li,
- Yibo Han,
- Yilong Zhao,
- Fangxin Liu,
- Xiaoyao Liang,
- Zhezhi He,
- Li Jiang
2021 58th ACM/IEEE Design Automation Conference (DAC)Pages 583–588https://doi.org/10.1109/DAC18074.2021.9586231
Graph Convolutional Network (GCN) is a promising but computing- and memory-intensive learning model. Processing-in-memory (PIM) architecture based on the ReRAM crossbar is a natural fit for GCN inference. It can reduce the data movements and compute the ...
2
Metrics
Total Citations2
research-article
November 2021
Bit-Transformer: Transforming Bit-level Sparsity into Higher Preformance in ReRAM-based Accelerator
2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD)Pages 1–9https://doi.org/10.1109/ICCAD51958.2021.9643569
Resistive Random-Access-Memory (ReRAM) crossbar is one of the most promising neural network accelerators, thanks to its in-memory and in-situ analog computing abilities for Matrix Multiplication-and-Accumulations (MACs). Nevertheless, the number of rows ...
4
Metrics
Total Citations4
short-paper
October 2021
AdaptiveGCN: Efficient GCN Through Adaptively Sparsifying Graphs
- Dongyue Li,
- Tao Yang,
- Lun Du,
- Zhezhi He,
- Li Jiang
CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge ManagementPages 3206–3210https://doi.org/10.1145/3459637.3482049

Graph Convolutional Networks (GCNs) have become the prevailing approach to efficiently learn representations from graph-structured data. Current GCN models adopt a neighborhood aggregation mechanism based on two primary operations, aggregation and ...
6
368
Metrics
Total Citations6
Total Downloads368
Last 12 Months56
Last 6 weeks2
Get Access
research-article
September 2021
BISWSRBS: A Winograd-based CNN Accelerator with a Fine-grained Regular Sparsity Pattern and Mixed Precision Quantization
- Tao Yang,
- Zhezhi He,
- Tengchuan Kou,
- Qingzheng Li,
- Qi Han,
- Haibao Yu,
- Fangxin Liu,
- Yun Liang,
- Li Jiang
ACM Transactions on Reconfigurable Technology and Systems (TRETS), Volume 14, Issue 4Article No.: 18, Pages 1–28https://doi.org/10.1145/3467476
Field-programmable Gate Array (FPGA) is a high-performance computing platform for Convolution Neural Networks (CNNs) inference. Winograd algorithm, weight pruning, and quantization are widely adopted to reduce the storage and arithmetic overhead of CNNs ...
3
446
Metrics
Total Citations3
Total Downloads446
Last 12 Months79
Last 6 weeks11
Get Access
research-article
Open Access
September 2021
Elf: accelerate high-resolution mobile deep vision with content-aware parallel offloading
MobiCom '21: Proceedings of the 27th Annual International Conference on Mobile Computing and NetworkingPages 201–214https://doi.org/10.1145/3447993.3448628

As mobile devices continuously generate streams of images and videos, a new class of mobile deep vision applications are rapidly emerging, which usually involve running deep neural networks on these multimedia data in real-time. To support such ...
98
3,270
Metrics
Total Citations98
Total Downloads3,270
Last 12 Months913
Last 6 weeks89
View online with eReader
PDF
research-article
June 2021
Energy-Efficient Hybrid-RAM with Hybrid Bit-Serial based VMM Support
- Chen Nie,
- Jie Lin,
- Huan Hu,
- Li Jiang,
- Xiaoyao Liang,
- Zhezhi He
GLSVLSI '21: Proceedings of the 2021 Great Lakes Symposium on VLSIPages 347–352https://doi.org/10.1145/3453688.3461528

This work presents HRAM, a SRAM-based hybrid memory bit-cell for energy-efficient in-memory computing purpose. The HRAM bit-cell consists of conventional 6T-SRAM for static data storage, and extra one accessing transistor and capacitor for caching data ...
0
184
Metrics
Total Citations0
Total Downloads184
Last 12 Months28
Last 6 weeks3
1
Supplementary Material
meeting_02.mp4
Get Access
research-article
June 2021
Re2PIM: A Reconfigurable ReRAM-Based PIM Design for Variable-Sized Vector-Matrix Multiplication
GLSVLSI '21: Proceedings of the 2021 Great Lakes Symposium on VLSIPages 15–20https://doi.org/10.1145/3453688.3461494

ReRAM-based deep neural network (DNN) accelerator shows enormous potential because of ReRAM's high computational-density and power-efficiency. A typical feature of DNNs is that weight matrix size varies across diverse DNNs and DNN layers. However, ...
3
324
Metrics
Total Citations3
Total Downloads324
Last 12 Months47
Last 6 weeks4
1
Supplementary Material
GLSVLSI21-glsv056.mp4
Get Access

Applied Filters

People

Names

Institutions

Authors

Advisors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences