Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Analysis of Multigrid Algorithms on Massively Parallel Computers

Published: 25 February 1996 Publication History

Abstract

We study the potential performance of multigrid algorithms running on massively parallel computers with the intent of discovering whether currently envisioned machines will provide an efficient platform for such algorithms. These algorithms substantially improve the performance of iterative methods of solving partial differential equations. We consider the domain parallel version of the standard V-cycle multigrid algorithm on model problems, discretized using finite difference techniques in two and three dimensions on block-structured grids of size 106and 109, respectively. We develop a set of models of parallel computation which reflect the computing characteristics of the current generation of massively parallel multicomputers. These models are based on an interconnection network of 256 to 16,384 message passing, “workstation size” processors executing in a SPMD mode. The models, based on the computing characteristics of an architectural class, provide metrics which balance abstraction with machine specificity. With the medium grain parallelism of the current generation and the high fixed cost of an interprocessor communication, our analysis suggests that an efficient implementation for practical problem sizes requires the machine to support the efficient transmission of long messages (up to 1000 words); otherwise the high initiation cost of a communication must be significantly reduced through an alternative optimization technique. The analysis also suggests that low diameter multistage networks provide little or no advantage over a simple single stage communications network. Finally, the analysis suggests that fine grain parallelism and low fixed communication costs may provide more efficiency than medium grain parallelism with low variable communications costs.

References

[1]
Alverson, R., Callahan, D., Cumming, D., Koblenz, B., Porterfield, A., and Smith, B. The TERA computer system. Proc. ACM Symposium on Supercomputing, 1990, pp. 1-7.
[2]
Briggs, W. A Multigrid Tutuorial. SIAM, Philadelphia, 1987.
[3]
Bozkus, Z., Ranka, S., and Fox, G. Benchmarking the CM-5 multicomputer. Proc. of the Fourth Symposium on the Frontiers of Massively Parallel Computation (H. Siegel, Ed.), IEEE Computer Society Press, 1992, pp. 100-107.
[4]
Chan, T., and Tuminaro, R. Analysis of a parallel multigrid algorithm. Proc. of the Fourth Copper Mountain Conference on Multigrid Methods (S. McCormick, Ed.). 1989, pp. 66-86.
[5]
The Connection Machine Series, Technical Summary. Thinking Machines, Inc., Cambridge, MA., 1992.
[6]
CM5 Technical Summary. Thinking Machines Corporation, Cambridge, MA, 1991.
[7]
Culler, D., Karp, R., Patterson, D., Sahay, A., Schauser, K., Santos, E., Subramonian, R., and von Eiken, T. LogP: Towards a realistic model of parallel computation. SIGPLAN Not. 28 (July 1993), 1-12.
[8]
Dally, W. Express cubes: Improving performance of k-ary n-cube interconnection networks. IEEE Trans. Compu. 40 (1991), 1016-1023.
[9]
Dorward, S., Matheson, L., and Tarjan, R. Unstructured multigrid strategies on massively parallel computers: A case for integrated design. Proc. of the Twenty-Seventh Annual Hawaii International Conference on System Sciences. IEEE Computer Society Press, 1994, pp. 169-179.
[10]
Forsythe, G., and Wasow, W. Finite Difference Methods for Partial Differential Equations. Wiley, New York, 1960.
[11]
Gannon, D., and Van Rosendale, J. On the structure of parallelism in a highly concurrent PDE solver. J. Parallel Distrib. Comput. 3, (1986), 106-135.
[12]
Greenbaum, A. A multigrid method for multiprocessors. Appl. Math. Comput. 19 (1986), 75-88.
[13]
Hockney, R. The communications challenge for MPP: The intel paragon and the Meiko CS-2. Parallel Comput. 20 (Mar. 1994), 389-398.
[14]
Jesperson, D. Multigrid methods for partial differential equations. In Golub, G. (Ed.). Studies in Numerical Analysis. Math. Soc. of America, Washington, DC, 1984, pp. 270-318.
[15]
Lenoski, D., Laudon, J., Joe, T., Nakahira, D., Stevens, L., Gupta, A., and Hennessy, J. The Stanford DASH prototype: Logic overhead and performance. IEEE Trans. Parallel Distrib. Systems 4 (Jan. 1993), 41-61.
[16]
Martinelli, L. Private communication. Department of Aerospace and Mechanical Engineering, Princeton University, 1994.
[17]
Matheson, L. Multigrid algorithms on massively parallel computers. Ph.D. dissertation, Department of Computer Science, Princeton University, 1994.
[18]
Matheson, L., and Tarjan, R. A critical analysis of multigrid methods on massively parallel computers. Contributions to Multigrid: A Selection of Contributions to the Fourth European Multigrid Conference. CWI, Amsterdam, to appear.
[19]
Ponnusamy, R., Thakur, R., Choudhary, A., Velamakanni, K., Bozkus, Z., and Fox, G. Communications on the CM-5: An experimental performance evaluation. J. Parallel Distrib. Comput. 19 (Nov. 1993), 192-202.
[20]
Tomayo, P. Private communications. Applications Analyst, Thinking Machines, Inc., Cambridge, MA, 1992.
[21]
Seitz, C. Private communication. California Institute of Technology, 1992.
[22]
Spertus, E., Goldstein, S., Schauser, K., von Eiken, T., Culler D., and Dally, W. Evaluation of mechanisms for fine-grained parallel programs in the J-machine and the CM-5. Proc. of the 20th Annual International Symposium on Computer Architecture, May 1993, pp. 302-313.
[23]
T3D System Architecture Overview. #HP-04033, Cray Research, Inc., Egan, MN, 1993.
[24]
Valiant, L. A bridging model for parallel computation. Comm. ACM 33 (Aug. 1990), 103-111.
[25]
Varga, R. Matrix iterative analysis. Prentice-Hall, Englewood Cliffs, NJ, 1962.
[26]
Von Eicken, T., Culler, D., Goldstein, S. Active messages: A mechanism for integrated communication and computation. Comput. Architecture News 20 (May 1992), 256-266.

Cited By

View all
  • (2023)An Efficient Parallel Adaptive GMG Solver for Large-Scale Stokes ProblemsEuro-Par 2023: Parallel Processing10.1007/978-3-031-39698-4_47(694-709)Online publication date: 28-Aug-2023
  • (2010)High-order accurate solution of the incompressible Navier-Stokes equations on massively parallel computersJournal of Computational Physics10.1016/j.jcp.2010.01.015229:10(3543-3572)Online publication date: 1-May-2010

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Journal of Parallel and Distributed Computing
Journal of Parallel and Distributed Computing  Volume 33, Issue 1
Feb. 25, 1996
106 pages
ISSN:0743-7315
Issue’s Table of Contents

Publisher

Academic Press, Inc.

United States

Publication History

Published: 25 February 1996

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)An Efficient Parallel Adaptive GMG Solver for Large-Scale Stokes ProblemsEuro-Par 2023: Parallel Processing10.1007/978-3-031-39698-4_47(694-709)Online publication date: 28-Aug-2023
  • (2010)High-order accurate solution of the incompressible Navier-Stokes equations on massively parallel computersJournal of Computational Physics10.1016/j.jcp.2010.01.015229:10(3543-3572)Online publication date: 1-May-2010

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media