invited-talk

Open access

Opportunities for RTL and gate level simulation using GPUs

Authors:

Brucek KhailanyAuthors Info & Claims

ICCAD '20: Proceedings of the 39th International Conference on Computer-Aided Design

Article No.: 166, Pages 1 - 5

https://doi.org/10.1145/3400302.3415773

Published: 17 December 2020 Publication History

Abstract

This paper summarizes the opportunities in accelerating simulation on parallel processing hardware platforms such as GPUs. First, we give a summary of prior art. Then, we propose the idea that coding frameworks usually used for popular machine learning (ML) topics, such as PyTorch/DGL.ai, can also be used for exploring simulation purposes. We demo a crude oblivious two-value cycle gate-level simulator using the higher level ML framework APIs that exhibits >20X speedup, despite its simplistic construction. Next, we summarize recent advances in GPU features that may provide additional opportunities to further state-of-the-art results. Finally, we conclude and touch upon some potential areas for furthering research into the topic of GPU accelerated simulation.

References

[1]

NVIDIA A100 Tensor Core GPU Architecture. https://www.nvidia.com/content/dam/en-zz/Solutions/Data-Center/nvidia-ampere-architecture-whitepaper.pdf, 2020.

[2]

Stefan Holst, Michael E. Imhof, and Hans-Joachim Wunderlich. High-throughput logic timing simulation on gpgpus. ACM Trans. Des. Autom. Electron. Syst., 20(3), June 2015.

Digital Library

[3]

Yuhao Zhu, Bo Wang, and Yangdong Deng. Massively parallel logic simulation with gpus. ACM Trans. Des. Autom. Electron. Syst., 16(3), June 2011.

Digital Library

[4]

A. Sen, B. Aksanli, M. Bozkurt, and M. Mert. Parallel cycle based logic simulation using graphics processing units. In 2010 Ninth International Symposium on Parallel and Distributed Computing, pages 71--78, 2010.

Digital Library

[5]

Debapriya Chatterjee, Andrew Deorio, and Valeria Bertacco. Gate-level simulation with gpu computing. ACM Trans. Des. Autom. Electron. Syst., 16(3), June 2011.

Digital Library

[6]

H. Qian and Y. Deng. Accelerating rtl simulation with gpus. In 2011 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), pages 687--693, 2011.

Digital Library

[7]

S. Vinco, V. Bertacco, D. Chatterjee, and F. Fummi. Saga: Systemc acceleration on gpu architectures. In DAC Design Automation Conference 2012, pages 115--120, 2012.

Digital Library

[8]

B. Catanzaro, K. Keutzer, and Bor-Yiing Su. Parallelizing cad: A timely research agenda for eda. In 2008 45th ACM/IEEE Design Automation Conference, pages 12--17, 2008.

Digital Library

[9]

PyTorch. https://pytorch.org, 2020.

[10]

DGL.ai. https://www.dgl.ai, 2020.

[11]

CUDA Toolkit 11.0. https://developer.nvidia.com/cuda-downloads, 2020.

[12]

Yibo Lin, Shounak Dhar, Wuxi Li, Haoxing Ren, Brucek Khailany, and David Z. Pan. Dreamplace: Deep learning toolkit-enabled gpu acceleration for modern vlsi placement. In Proceedings of the 56th Annual Design Automation Conference 2019, DAC '19, New York, NY, USA, 2019. Association for Computing Machinery.

Digital Library

[13]

Thomas N. Kipf and Max Welling. Semi-supervised classification with graph convolutional networks, 2016.

[14]

Yanqing Zhang, Haoxing Ren, Ben Keller, and Brucek Khailany. Problem c: Gpu accelerated logic re-simulation. In 2020 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 2020.

[15]

NVIDIA Ampere Architecture In-Depth. https://developer.nvidia.com/blog/nvidia-ampere-architecture-in-depth, 2020.

[16]

NVIDIA Nsight Compute. https://developer.nvidia.com/nsight-compute, 2020.

Cited By

Tian XYue CPi YLi TQu W(2024)CPGPUSim: A Multi-dimensional Parallel Acceleration Framework for RTL Simulation2024 2nd International Symposium of Electronics Design Automation (ISEDA)10.1109/ISEDA62518.2024.10618075(272-277)Online publication date: 10-May-2024
https://doi.org/10.1109/ISEDA62518.2024.10618075
Lin DOgras UMiguel JHuang T(2024)TaroRTL: Accelerating RTL Simulation Using Coroutine-Based Heterogeneous Task Graph SchedulingEuro-Par 2024: Parallel Processing10.1007/978-3-031-69583-4_11(151-166)Online publication date: 26-Aug-2024
https://doi.org/10.1007/978-3-031-69583-4_11
Wang HBeamer SAamodt TJerger NSwift M(2023)RepCut: Superlinear Parallel RTL Simulation with Replication-Aided PartitioningProceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 310.1145/3582016.3582034(572-585)Online publication date: 25-Mar-2023
https://dl.acm.org/doi/10.1145/3582016.3582034
Show More Cited By

Recommendations

From RTL to CUDA: A GPU Acceleration Flow for RTL Simulation with Batch Stimulus
ICPP '22: Proceedings of the 51st International Conference on Parallel Processing

High-throughput RTL simulation is critical for verifying today’s highly complex SoCs. Recent research has explored accelerating RTL simulation by leveraging event-driven approaches or partitioning heuristics to speed up simulation on a single stimulus. ...
RepCut: Superlinear Parallel RTL Simulation with Replication-Aided Partitioning
ASPLOS 2023: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3

Register transfer level (RTL) simulation is an invaluable tool for developing, debugging, verifying, and validating hardware designs. Despite the parallel nature of hardware, existing parallel RTL simulators yield speedups unattractive for practical ...
GATSPI: GPU accelerated gate-level simulation for power improvement
DAC '22: Proceedings of the 59th ACM/IEEE Design Automation Conference

In this paper, we present GATSPI, a novel GPU accelerated logic gate simulator that enables ultra-fast power estimation for industry-sized ASIC designs with millions of gates. GATSPI is written in PyTorch with custom CUDA kernels for ease of coding and ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

ICCAD '20: Proceedings of the 39th International Conference on Computer-Aided Design

November 2020

1396 pages

ISBN:9781450380263

DOI:10.1145/3400302

General Chair:
Yuan Xie
Univ. of California, Santa Barbara, CA

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGDA: ACM Special Interest Group on Design Automation

In-Cooperation

IEEE CAS
IEEE CEDA
IEEE CS

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 December 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Invited-talk

Conference

ICCAD '20

Sponsor:

SIGDA

ICCAD '20: IEEE/ACM International Conference on Computer-Aided Design

November 2 - 5, 2020

Virtual Event, USA

Acceptance Rates

Overall Acceptance Rate 457 of 1,762 submissions, 26%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

8
Total Citations
View Citations
820
Total Downloads

Downloads (Last 12 months)195
Downloads (Last 6 weeks)23

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Tian XYue CPi YLi TQu W(2024)CPGPUSim: A Multi-dimensional Parallel Acceleration Framework for RTL Simulation2024 2nd International Symposium of Electronics Design Automation (ISEDA)10.1109/ISEDA62518.2024.10618075(272-277)Online publication date: 10-May-2024
https://doi.org/10.1109/ISEDA62518.2024.10618075
Lin DOgras UMiguel JHuang T(2024)TaroRTL: Accelerating RTL Simulation Using Coroutine-Based Heterogeneous Task Graph SchedulingEuro-Par 2024: Parallel Processing10.1007/978-3-031-69583-4_11(151-166)Online publication date: 26-Aug-2024
https://doi.org/10.1007/978-3-031-69583-4_11
Wang HBeamer SAamodt TJerger NSwift M(2023)RepCut: Superlinear Parallel RTL Simulation with Replication-Aided PartitioningProceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 310.1145/3582016.3582034(572-585)Online publication date: 25-Mar-2023
https://dl.acm.org/doi/10.1145/3582016.3582034
Dzaka ELin DHuang T(2023)Parallel And-Inverter Graph Simulation Using a Task-graph Computing System2023 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)10.1109/IPDPSW59300.2023.00150(923-929)Online publication date: May-2023
https://doi.org/10.1109/IPDPSW59300.2023.00150
Gavier IRussell JPatel DRietman ESiegelmann H(2023)Neural Network Compiler for Parallel High-Throughput Simulation of Digital Circuits2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS54959.2023.00067(613-623)Online publication date: May-2023
https://doi.org/10.1109/IPDPS54959.2023.00067
Guo ZZhang ZJiang XLi WLin YWang RHuang R(2023)General-Purpose Gate-Level Simulation with Partition-Agnostic Parallelism2023 60th ACM/IEEE Design Automation Conference (DAC)10.1109/DAC56929.2023.10247907(1-6)Online publication date: 9-Jul-2023
https://doi.org/10.1109/DAC56929.2023.10247907
Emil DHamdy MNagib G(2023)Development an efficient AXI-interconnect unit between set of customized peripheral devices and an implemented dual-core RISC-V processorThe Journal of Supercomputing10.1007/s11227-023-05304-179:15(17000-17019)Online publication date: 5-May-2023
https://doi.org/10.1007/s11227-023-05304-1
Chhabria VKeller BZhang YVollala SPratty SRen HKhailany BFranzon PKahng ALi HLi B(2022)XT-PRAGGMA: Crosstalk Pessimism Reduction Achieved with GPU Gate-level Simulations and Machine LearningProceedings of the 2022 ACM/IEEE Workshop on Machine Learning for CAD10.1145/3551901.3556483(63-69)Online publication date: 12-Sep-2022
https://dl.acm.org/doi/10.1145/3551901.3556483
Huang GHu JHe YLiu JMa MShen ZWu JXu YZhang HZhong KNing XMa YYang HYu BYang HWang Y(2021)Machine Learning for Electronic Design Automation: A SurveyACM Transactions on Design Automation of Electronic Systems10.1145/345117926:5(1-46)Online publication date: 5-Jun-2021
https://dl.acm.org/doi/10.1145/3451179

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten