24/7 characterization of petascale I/O workloads

P Carns, R Latham, R Ross, K Iskra… - … on Cluster Computing …, 2009 - ieeexplore.ieee.org
2009 IEEE International Conference on Cluster Computing and Workshops, 2009ieeexplore.ieee.org
Developing and tuning computational science applications to run on extreme scale systems
are increasingly complicated processes. Challenges such as managing memory access and
tuning message-passing behavior are made easier by tools designed specifically to aid in
these processes. Tools that can help users better understand the behavior of their
application with respect to I/O have not yet reached the level of utility necessary to play a
central role in application development and tuning. This deficiency in the tool set means that …
Developing and tuning computational science applications to run on extreme scale systems are increasingly complicated processes. Challenges such as managing memory access and tuning message-passing behavior are made easier by tools designed specifically to aid in these processes. Tools that can help users better understand the behavior of their application with respect to I/O have not yet reached the level of utility necessary to play a central role in application development and tuning. This deficiency in the tool set means that we have a poor understanding of how specific applications interact with storage. Worse, the community has little knowledge of what sorts of access patterns are common in today's applications, leading to confusion in the storage research community as to the pressing needs of the computational science community. This paper describes the Darshan I/O characterization tool. Darshan is designed to capture an accurate picture of application I/O behavior, including properties such as patterns of access within files, with the minimum possible overhead. This characterization can shed important light on the I/O behavior of applications at extreme scale. Darshan also can enable researchers to gain greater insight into the overall patterns of access exhibited by such applications, helping the storage community to understand how to best serve current computational science applications and better predict the needs of future applications. In this work we demonstrate Darshan's ability to characterize the I/O behavior of four scientific applications and show that it induces negligible overhead for I/O intensive jobs with as many as 65,536 processes.
ieeexplore.ieee.org