User profiles for Dorian C. Arnold
Dorian ArnoldEmory University Verified email at emory.edu Cited by 2396 |
MRNet: A software-based multicast/reduction network for scalable tools
We present MRNet, a software-based multicast/reduction network for building scalable
performance and system administration tools. MRNet supports multiple simultaneous, …
performance and system administration tools. MRNet supports multiple simultaneous, …
Stack trace analysis for large scale debugging
We present the Stack Trace Analysis Tool (STAT) to aid in debugging extreme-scale applications.
STAT can reduce problem exploration spaces from thousands of processes to a few by …
STAT can reduce problem exploration spaces from thousands of processes to a few by …
Lessons learned at 208k: towards debugging millions of cores
Petascale systems will present several new challenges to performance and correctness
tools. Such machines may contain millions of cores, requiring that tools use scalable data …
tools. Such machines may contain millions of cores, requiring that tools use scalable data …
Innovations of the NetSolve grid computing system
DC Arnold, H Casanova… - … and computation: practice …, 2002 - Wiley Online Library
The NetSolve Grid Computing System was first developed in the mid 1990s to provide users
with seamless access to remote computational hardware and software resources. Since …
with seamless access to remote computational hardware and software resources. Since …
Request sequencing: Optimizing communication for the Grid
DC Arnold, D Bachmann, J Dongarra - European Conference on Parallel …, 2000 - Springer
As we research to make the use of Computational Grids seamless, the allocation of resources
in these dynamic environments is proving to be very unwieldy. In this paper, we introduce, …
in these dynamic environments is proving to be very unwieldy. In this paper, we introduce, …
A framework for scalable, parallel performance monitoring
Performance monitoring of HPC applications offers opportunities for adaptive optimization
based on the dynamic performance behavior, unavailable in purely post‐mortem performance …
based on the dynamic performance behavior, unavailable in purely post‐mortem performance …
(SAI) Stalled, Active and Idle: Characterizing Power and Performance of Large-Scale Dragonfly Networks
…, M Levenhagen, DC Arnold - 2016 IEEE …, 2016 - ieeexplore.ieee.org
Exascale networks are expected to comprise a significant part of the total monetary cost and
10-20% of the power budget allocated to exascale systems. Yet, our understanding of …
10-20% of the power budget allocated to exascale systems. Yet, our understanding of …
Scalable failure recovery for high-performance data aggregation
Many high-performance tools, applications and infrastructures, such as Paradyn, STAT, TAU,
Ganglia, SuperMon, Astrolabe, Borealis, and MRNet, use data aggregation to synthesize …
Ganglia, SuperMon, Astrolabe, Borealis, and MRNet, use data aggregation to synthesize …
[PDF][PDF] Reliable, scalable tree-based overlay networks
DC Arnold - 2008 - ftp1.cs.wisc.edu
Ultimately, our Creator is responsible for all. I was fortunate to have an excellent and strong
support system throughout this process. With great pleasure and extreme gratitude, I thank …
support system throughout this process. With great pleasure and extreme gratitude, I thank …
Overcoming scalability challenges for tool daemon launching
Many tools that target parallel and distributed environments must co-locate a set of daemons
with the distributed processes of the target application. However, efficient and portable …
with the distributed processes of the target application. However, efficient and portable …