Professional topics

Article

Measuring execution times of collective communications in an empirical optimization framework

EuroMPI'10: Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interfacePages 294–297

An essential part of an empirical optimization library are the timing procedures with which the performance of different codelets is determined. In this paper, we present for four different timing methods to optimize collective MPI communications and ...

Article

Adaptive MPI multirail tuning for non-uniform input/output access

EuroMPI'10: Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interfacePages 239–248

Multicore processors have not only reintroduced Non-Uniform Memory Access (NUMA) architectures in nowadays parallel computers, but they are also responsible for non-uniform access times with respect to Input/Output devices (NUIOA). In clusters of ...

Article

Load balancing for regular meshes on SMPs with MPI

EuroMPI'10: Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interfacePages 229–238

Domain decomposition for regular meshes on parallel computers has traditionally been performed by attempting to exactly partition the work among the available processors (now cores). However, these strategies often do not consider the inherent system ...

Article

Transparent redundant computing with MPI

EuroMPI'10: Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interfacePages 208–218

Extreme-scale parallel systems will require alternative methods for applications to maintain current levels of uninterrupted execution. Redundant computation is one approach to consider, if the benefits of increased resiliency outweigh the cost of ...

Article

Communication target selection for replicated MPI processes

EuroMPI'10: Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interfacePages 198–207

VolpexMPI is an MPI library designed for volunteer computing environments. In order to cope with the fundamental unreliability of these environments, VolpexMPI deploys two or more replicas of each MPI process. A receiver-driven communication scheme is ...

Article

Dodging the cost of unavoidable memory copies in message logging protocols

EuroMPI'10: Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interfacePages 189–197

With the number of computing elements spiraling to hundred of thousands in modern HPC systems, failures are common events. Few applications are nevertheless fault tolerant; most are in need for a seamless recovery framework. Among the automatic fault ...

Article

Characteristics of the unexpected message queue of MPI applications

EuroMPI'10: Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interfacePages 179–188

High Performance Computing systems are used on a regular basis to run a myriad of application codes, yet a surprising dearth of information exists with respect to communications characteristics. Even less information is available on the low-level ...

Article

Implementing MPI on windows: comparison with common approaches on Unix

EuroMPI'10: Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interfacePages 160–169

Commercial HPC applications are often run on clusters that use the Microsoft Windows operating system and need an MPI implementation that runs efficiently in the Windows environment. The MPI developer community, however, is more familiar with the issues ...

Article

Network offloaded hierarchical collectives using ConnectX-2's CORE-Direct capabilities

EuroMPI'10: Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interfacePages 102–112

As the scale of High Performance Computing (HPC) systems continues to increase, demanding that we extract even more parallelism from applications, the need to move communication management away from the Central Processing Unit (CPU) becomes even ...

Article

Design of kernel-level asynchronous collective communication

EuroMPI'10: Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interfacePages 92–101

Overlapping computation and communication, not only point-to-point but also collective communications, is an important technique to improve the performance of parallel programs. Since the current non-blocking collective communications have been mostly ...

Applied Filters

People

Names

Institutions

Authors

Publications

Proceedings/Book Names

All Publications

Publisher

Conferences

Sponsors

Publication Date

Measuring execution times of collective communications in an empirical optimization framework

Adaptive MPI multirail tuning for non-uniform input/output access

Load balancing for regular meshes on SMPs with MPI

Transparent redundant computing with MPI

Communication target selection for replicated MPI processes

Dodging the cost of unavoidable memory copies in message logging protocols

Characteristics of the unexpected message queue of MPI applications

Implementing MPI on windows: comparison with common approaches on Unix

Network offloaded hierarchical collectives using ConnectX-2's CORE-Direct capabilities

Design of kernel-level asynchronous collective communication

Applied Filters

People

Names

Institutions

Authors

Publications

Proceedings/Book Names

All Publications

Publisher

Conferences

Sponsors

Publication Date

Save to Binder