Abstract
Workload characterization is an important technique that helps us understand the performance of parallel applications and the demands they place on the system. It can be used to describe performance effects due to application parameters, compiler options, and platform configurations. In this paper, workload characterization features in the TAU parallel performance system are demonstrated for elucidating the performance of the MPI library based on the sizes of messages. Such characterization partitions the time spent in the MPI routines used by an application based on the type of MPI operation and the message size involved. It requires a two-level mapping of performance data, a unique feature implemented in TAU. Results from the NPB LU benchmark are presented. We also discuss the use of mapping for memory consumption characterization.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Shende, S., Malony, A.D.: The TAU Parallel Performance System. International Journal of High Performance Computing Applications 20(2), 287–331 (2006)
Ong, H., Subramaniyan, R., Leangsuksun, C., Studham, S.: OpenWLC: A Scalable Workload Characterization System. In: High Availability and Performance Workshop, in conjunction with Sixth LACSI Symposium (October 11-13, 2005), http://xcr.cenit.latech.edu/wlc/index.php?title=PUBLICATIONS
Borrill, J., Carter, J., Oliker, L., Skinner, D., Biswas, R.: Integrated Performance Monitoring of a Cosmology Application on Leading HEC Platforms. In: Proc. of International Conference on Parallel Processing (ICPP 2005), pp. 119–128. IEEE, Los Alamitos (2005)
Kufrin, R.: PerfSuite: An Accessible, Open Source Performance Analysis Environment for Linux. In: Proceedings of the 6th International Conference on Linux Clusters: The HPC Revolution 2005 (LCI-05) (2005)
Browne, S., Dongarra, J., Garner, N., Ho, G., Mucci, P.: A Portable Programming Interface for Performance Evaluation on Modern Processors. International Journal of High Performance Computing Applications 14(3), 189–204 (2000)
Shende, S.: The Role of Instrumentation and Mapping in Performance Measurement. Ph.D. Dissertation, University of Oregon (August 2001)
Malony, A.D., Shende, S., Morris, A.: Phase-Based Parallel Performance Profiling. In: Proceedings of the PARCO 2005 conference (2005)
Huck, K.A., Malony, A.D., Bell, R., Morris, A.: Design and Implementation of a Parallel Performance Data Management Framework. In: Proceedings of International Conference on Parallel Processing (ICPP 2005), IEEE Computer Society, Los Alamitos (2005)
Huck, K.A., Malony, A.D.: PerfExplorer: A Performance Data Mining Framework for Large-Scale Parallel Computing. In: SC 2005, ACM, New York (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shende, S., Malony, A.D., Morris, A. (2007). Workload Characterization Using the TAU Performance System. In: Kågström, B., Elmroth, E., Dongarra, J., Waśniewski, J. (eds) Applied Parallel Computing. State of the Art in Scientific Computing. PARA 2006. Lecture Notes in Computer Science, vol 4699. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75755-9_35
Download citation
DOI: https://doi.org/10.1007/978-3-540-75755-9_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75754-2
Online ISBN: 978-3-540-75755-9
eBook Packages: Computer ScienceComputer Science (R0)