Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1109/CCGrid.2011.44acmotherconferencesArticle/Chapter ViewAbstractPublication PagesccgridConference Proceedingsconference-collections
Article

Assertion Based Parallel Debugging

Published: 23 May 2011 Publication History

Abstract

Programming languages have advanced tremendously over the years, but program debuggers have hardly changed. Sequential debuggers do little more than allow a user to control the flow of a program and examine its state. Parallel ones support the same operations on multiple processes, which are adequate with a small number of processors, but become unwieldy and ineffective on very large machines. Typical scientific codes have enormous multi-dimensional data structures and it is impractical to expect a user to view the data using traditional display techniques. In this paper we discuss the use of debug-time assertions, and show that these can be used to debug parallel programs. The techniques reduce the debugging complexity because they reason about the state of large arrays without requiring the user to know the expected value of every element. Assertions can be expensive to evaluate, but their performance can be improved by running them in parallel. We demonstrate the system with a case study finding errors in a parallel version of the Shallow Water Equations, and evaluate the performance of the tool on a 4,096 cores Cray XE6.

Cited By

View all
  • (2015)Relative debugging for a highly parallel hybrid computer systemProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/2807591.2807605(1-12)Online publication date: 15-Nov-2015
  • (2014)Accurate application progress analysis for large-scale parallel debuggingACM SIGPLAN Notices10.1145/2666356.259433649:6(193-203)Online publication date: 9-Jun-2014
  • (2014)Accurate application progress analysis for large-scale parallel debuggingProceedings of the 35th ACM SIGPLAN Conference on Programming Language Design and Implementation10.1145/2594291.2594336(193-203)Online publication date: 9-Jun-2014
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
CCGRID '11: Proceedings of the 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing
May 2011
638 pages
ISBN:9780769543956

Publisher

IEEE Computer Society

United States

Publication History

Published: 23 May 2011

Check for updates

Author Tags

  1. Guard
  2. MPI
  3. assertion
  4. parallel debugger

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 21 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2015)Relative debugging for a highly parallel hybrid computer systemProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/2807591.2807605(1-12)Online publication date: 15-Nov-2015
  • (2014)Accurate application progress analysis for large-scale parallel debuggingACM SIGPLAN Notices10.1145/2666356.259433649:6(193-203)Online publication date: 9-Jun-2014
  • (2014)Accurate application progress analysis for large-scale parallel debuggingProceedings of the 35th ACM SIGPLAN Conference on Programming Language Design and Implementation10.1145/2594291.2594336(193-203)Online publication date: 9-Jun-2014
  • (2012)Probabilistic diagnosis of performance faults in large-scale parallel applicationsProceedings of the 21st international conference on Parallel architectures and compilation techniques10.1145/2370816.2370848(213-222)Online publication date: 19-Sep-2012
  • (2012)Scalable parallel debugging with statistical assertionsACM SIGPLAN Notices10.1145/2370036.214587047:8(311-312)Online publication date: 25-Feb-2012
  • (2012)Scalable parallel debugging with statistical assertionsProceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming10.1145/2145816.2145870(311-312)Online publication date: 25-Feb-2012
  • (2012)A Scalable Parallel Debugging Library with Pluggable Communication ProtocolsProceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)10.1109/CCGrid.2012.13(252-259)Online publication date: 13-May-2012

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media