research-article

Optimizing Communications of Dynamic Data Redistribution on Symmetrical Matrices in Parallelizing Compilers

Authors:

Ching-Hsien Hsu,

Ming-Hao Chen,

Chao-Tung Yang,

Kuan-Ching LiAuthors Info & Claims

IEEE Transactions on Parallel and Distributed Systems, Volume 17, Issue 11

Pages 1226 - 1241

https://doi.org/10.1109/TPDS.2006.162

Published: 01 November 2006 Publication History

Publisher Site

Abstract

Dynamic data redistribution is used to enhance data locality and algorithm performance by reducing interprocessor communication in many parallel scientific applications on distributed memory multicomputers. Since the redistribution is performed at runtime, there is a performance tradeoff between the efficiency of the new data decomposition for a subsequent phase of an algorithm and the cost of redistributing data among processors. In this paper, we present a processor replacement scheme to minimize the cost of interprocessor data exchange during runtime. The main idea of the proposed technique is to develop a replacement function for reordering logical processors in the destination phase. Based on the replacement function, a realigned sequence of destination processors can be derived and is then used to perform data decomposition in the receiving phase. Together with local matrix and compressed CRS vectors transposition schemes, the interprocessor communication can be eliminated during runtime. A significant improvement of this approach is that the realignment of data can be performed without interprocessor communication for special cases. The second contribution of the present technique is that the complicated communication sets generation could be simplified by applying local matrix transposition. Consequently, the indexing cost could be reduced significantly. The proposed techniques can be applied in both dense and sparse applications. A generalized symmetric redistribution algorithm is also presented in this work. To analyze the efficiency of the proposed technique, the theoretical analysis proves that up to (p-1)/p data transmission cost can be saved. For general cases, the symmetric redistribution algorithm saves 1/p communication overheads compared with the traditional method. Experimental results also show that the proposed techniques provide superior performance in most data redistribution instances.

References

[1]

B. Chapman, P. Mehrotra, H. Moritsch, and H. Zima, “Dynamic Data Distribution in Vienna Fortran,” Proc. Supercomputing Conf. '93, pp. 284-293, Nov. 1993.

Abstract

References

Cited By

Index Terms

Recommendations

A Basic-Cycle Calculation Technique for Efficient Dynamic Data Redistribution

A Compressed Diagonals Remapping Technique for Dynamic Data Redistribution on Banded Sparse Matrix

Processor Mapping Techniques Toward Efficient Data Redistribution

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Share

Share this Publication link

Share on social media

Affiliations