Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3400302.3415773acmconferencesArticle/Chapter ViewAbstractPublication PagesiccadConference Proceedingsconference-collections
invited-talk
Open access

Opportunities for RTL and gate level simulation using GPUs

Published: 17 December 2020 Publication History

Abstract

This paper summarizes the opportunities in accelerating simulation on parallel processing hardware platforms such as GPUs. First, we give a summary of prior art. Then, we propose the idea that coding frameworks usually used for popular machine learning (ML) topics, such as PyTorch/DGL.ai, can also be used for exploring simulation purposes. We demo a crude oblivious two-value cycle gate-level simulator using the higher level ML framework APIs that exhibits >20X speedup, despite its simplistic construction. Next, we summarize recent advances in GPU features that may provide additional opportunities to further state-of-the-art results. Finally, we conclude and touch upon some potential areas for furthering research into the topic of GPU accelerated simulation.

References

[1]
NVIDIA A100 Tensor Core GPU Architecture. https://www.nvidia.com/content/dam/en-zz/Solutions/Data-Center/nvidia-ampere-architecture-whitepaper.pdf, 2020.
[2]
Stefan Holst, Michael E. Imhof, and Hans-Joachim Wunderlich. High-throughput logic timing simulation on gpgpus. ACM Trans. Des. Autom. Electron. Syst., 20(3), June 2015.
[3]
Yuhao Zhu, Bo Wang, and Yangdong Deng. Massively parallel logic simulation with gpus. ACM Trans. Des. Autom. Electron. Syst., 16(3), June 2011.
[4]
A. Sen, B. Aksanli, M. Bozkurt, and M. Mert. Parallel cycle based logic simulation using graphics processing units. In 2010 Ninth International Symposium on Parallel and Distributed Computing, pages 71--78, 2010.
[5]
Debapriya Chatterjee, Andrew Deorio, and Valeria Bertacco. Gate-level simulation with gpu computing. ACM Trans. Des. Autom. Electron. Syst., 16(3), June 2011.
[6]
H. Qian and Y. Deng. Accelerating rtl simulation with gpus. In 2011 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), pages 687--693, 2011.
[7]
S. Vinco, V. Bertacco, D. Chatterjee, and F. Fummi. Saga: Systemc acceleration on gpu architectures. In DAC Design Automation Conference 2012, pages 115--120, 2012.
[8]
B. Catanzaro, K. Keutzer, and Bor-Yiing Su. Parallelizing cad: A timely research agenda for eda. In 2008 45th ACM/IEEE Design Automation Conference, pages 12--17, 2008.
[9]
PyTorch. https://pytorch.org, 2020.
[10]
DGL.ai. https://www.dgl.ai, 2020.
[11]
CUDA Toolkit 11.0. https://developer.nvidia.com/cuda-downloads, 2020.
[12]
Yibo Lin, Shounak Dhar, Wuxi Li, Haoxing Ren, Brucek Khailany, and David Z. Pan. Dreamplace: Deep learning toolkit-enabled gpu acceleration for modern vlsi placement. In Proceedings of the 56th Annual Design Automation Conference 2019, DAC '19, New York, NY, USA, 2019. Association for Computing Machinery.
[13]
Thomas N. Kipf and Max Welling. Semi-supervised classification with graph convolutional networks, 2016.
[14]
Yanqing Zhang, Haoxing Ren, Ben Keller, and Brucek Khailany. Problem c: Gpu accelerated logic re-simulation. In 2020 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 2020.
[15]
NVIDIA Ampere Architecture In-Depth. https://developer.nvidia.com/blog/nvidia-ampere-architecture-in-depth, 2020.
[16]
NVIDIA Nsight Compute. https://developer.nvidia.com/nsight-compute, 2020.

Cited By

View all
  • (2024)CPGPUSim: A Multi-dimensional Parallel Acceleration Framework for RTL Simulation2024 2nd International Symposium of Electronics Design Automation (ISEDA)10.1109/ISEDA62518.2024.10618075(272-277)Online publication date: 10-May-2024
  • (2024)TaroRTL: Accelerating RTL Simulation Using Coroutine-Based Heterogeneous Task Graph SchedulingEuro-Par 2024: Parallel Processing10.1007/978-3-031-69583-4_11(151-166)Online publication date: 26-Aug-2024
  • (2023)RepCut: Superlinear Parallel RTL Simulation with Replication-Aided PartitioningProceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 310.1145/3582016.3582034(572-585)Online publication date: 25-Mar-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ICCAD '20: Proceedings of the 39th International Conference on Computer-Aided Design
November 2020
1396 pages
ISBN:9781450380263
DOI:10.1145/3400302
  • General Chair:
  • Yuan Xie
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

In-Cooperation

  • IEEE CAS
  • IEEE CEDA
  • IEEE CS

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 December 2020

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Invited-talk

Conference

ICCAD '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 457 of 1,762 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)189
  • Downloads (Last 6 weeks)17
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)CPGPUSim: A Multi-dimensional Parallel Acceleration Framework for RTL Simulation2024 2nd International Symposium of Electronics Design Automation (ISEDA)10.1109/ISEDA62518.2024.10618075(272-277)Online publication date: 10-May-2024
  • (2024)TaroRTL: Accelerating RTL Simulation Using Coroutine-Based Heterogeneous Task Graph SchedulingEuro-Par 2024: Parallel Processing10.1007/978-3-031-69583-4_11(151-166)Online publication date: 26-Aug-2024
  • (2023)RepCut: Superlinear Parallel RTL Simulation with Replication-Aided PartitioningProceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 310.1145/3582016.3582034(572-585)Online publication date: 25-Mar-2023
  • (2023)Parallel And-Inverter Graph Simulation Using a Task-graph Computing System2023 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)10.1109/IPDPSW59300.2023.00150(923-929)Online publication date: May-2023
  • (2023)Neural Network Compiler for Parallel High-Throughput Simulation of Digital Circuits2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS54959.2023.00067(613-623)Online publication date: May-2023
  • (2023)General-Purpose Gate-Level Simulation with Partition-Agnostic Parallelism2023 60th ACM/IEEE Design Automation Conference (DAC)10.1109/DAC56929.2023.10247907(1-6)Online publication date: 9-Jul-2023
  • (2023)Development an efficient AXI-interconnect unit between set of customized peripheral devices and an implemented dual-core RISC-V processorThe Journal of Supercomputing10.1007/s11227-023-05304-179:15(17000-17019)Online publication date: 5-May-2023
  • (2022)XT-PRAGGMA: Crosstalk Pessimism Reduction Achieved with GPU Gate-level Simulations and Machine LearningProceedings of the 2022 ACM/IEEE Workshop on Machine Learning for CAD10.1145/3551901.3556483(63-69)Online publication date: 12-Sep-2022
  • (2021)Machine Learning for Electronic Design Automation: A SurveyACM Transactions on Design Automation of Electronic Systems10.1145/345117926:5(1-46)Online publication date: 5-Jun-2021

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media