Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/181014.181081acmconferencesArticle/Chapter ViewAbstractPublication PagesspaaConference Proceedingsconference-collections
Article
Free access

Experiences with parallel N-body simulation

Published: 01 August 1994 Publication History

Abstract

This paper describes our experiences developing high-performance code for astrophysical N-body simulations. Recent N-body methods are based on an adaptive tree structure. The tree must be built and maintained across physically distributed memory; moreover, the communication requirements are irregular and adaptive. Together with the need to balance the computational work-load among processors, these issues pose interesting challenges and tradeoffs for high-performance implementation.
Our implementation was guided by the need to keep solutions simple and general. We use a technique for implicitly representing a dynamic global tree across multiple processors which substantially reduces the programming complexity as well as the performance overheads of distributed memory architectures. The contributions include methods to vectorize the computation and minimize communication time which are theoretically and experimentally justified.
The code has been tested by varying the number and distribution of bodies on different configurations of the Connection Machine CM-5. The overall performance on instances with 10 million bodies is typically over 30% of the peak machine rate. Preliminary timings compare favorably with other approaches.

References

[1]
Cmmd reference manual. Technical report, Thinking Machine Corporation, 1993.
[2]
S.J. Aarseth, M. Henon, and R. Wielen. Astronomy and Astrophysics, 37, 1974.
[3]
R. Anderson. personal communication. 1993.
[4]
A.W. Appel. An efficient program for many-body simulation. SIAM J. Sci. Star. Comput., 6, 1985.
[5]
J. Barnes. A modified tree code: Don't laugh; it runs. Journal o/ Computational Physics, 87:161- 170, 1990.
[6]
J. Barnes and P. Hut. A hierarchical O(Nlog N) force-calculation algorithm. Nature, 324:446-449, 1986.
[7]
S. Bhatt, M. Chen, C-Y Lin, and P. Liu. Abstractions for parallel N-body simulations. In Proceedings o# SHPCC 9P.
[8]
L. Greengard and V. Rokhlin. A fast algorithm for particle simulations. J. Comput. Physics, 73:325, 1987.
[9]
L. Johnsson and Y. Hu. personal communication. 1993.
[10]
J. F. Leathrum Jr. and J. Board Jr. The parallel fast multipole algorithm in three dimensions.
[11]
P. Liu, W. Aiello, and S. Bhatt. An atomic model for message passing. In 5th Annual A CM Symposium on Parallel Algorithms and Architecture, 1993.
[12]
L. Nyland, J. Prins, and J. Reif. A data-parallel implementation of the adaptive fast multipole algorithm. In DAGS/PC Symposium, 1993.
[13]
J. Salmon. Parallel Hierarchical N-body Methods. PhD thesis, Caltech, 1990.
[14]
J. Singh. Parallel Hierarchical N-body Methods and their Implications for Multiprocessors. PhD thesis, Stanford University, 1993.
[15]
J. Singh, C. Holt, T. Totsuka, A. Gupta, and J. Hennessy. Load balancing and data locality in hierarchical n-body methods. Technical Report CSL- TR-92-505, Stanford University, 1992.
[16]
S. Sundaram. Fast Algorithms/or N-body Simulations. PhD thesis, Cornell University, 1993.
[17]
M. Warren and J. Salmon. Astrophysical n-body simulations using hierarchical tree data structures. In Proceedings of Supercomputing, 1992.
[18]
M. Warren and J. Salmon. A parallel hashed octtree n-body algorithm, 1993.
[19]
F. Zhao and S.L. Johnsson. The parallel multipole method on the connection machine. Technical Report DCS/TR-749, Yale University, 1989.

Cited By

View all
  • (2012) Improving the scalability of parallel N -body applications with an event-driven constraint-based execution model The International Journal of High Performance Computing Applications10.1177/109434201244058526:3(319-332)Online publication date: 11-Apr-2012
  • (2005)Massively parallel implementation of a fast multipole method for distributed memory machinesJournal of Parallel and Distributed Computing10.1016/j.jpdc.2005.02.00165:7(870-881)Online publication date: 1-Jul-2005
  • (2005)Parallel progressive radiosity with adaptive meshingParallel Algorithms for Irregularly Structured Problems10.1007/BFb0030106(159-170)Online publication date: 16-Jun-2005
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SPAA '94: Proceedings of the sixth annual ACM symposium on Parallel algorithms and architectures
August 1994
374 pages
ISBN:0897916719
DOI:10.1145/181014
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 August 1994

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

6SPAA94
Sponsor:

Acceptance Rates

Overall Acceptance Rate 447 of 1,461 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)140
  • Downloads (Last 6 weeks)12
Reflects downloads up to 02 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2012) Improving the scalability of parallel N -body applications with an event-driven constraint-based execution model The International Journal of High Performance Computing Applications10.1177/109434201244058526:3(319-332)Online publication date: 11-Apr-2012
  • (2005)Massively parallel implementation of a fast multipole method for distributed memory machinesJournal of Parallel and Distributed Computing10.1016/j.jpdc.2005.02.00165:7(870-881)Online publication date: 1-Jul-2005
  • (2005)Parallel progressive radiosity with adaptive meshingParallel Algorithms for Irregularly Structured Problems10.1007/BFb0030106(159-170)Online publication date: 16-Jun-2005
  • (2005)Bandwidth efficient parallel computationAutomata, Languages and Programming10.1007/3-540-61440-0_114(4-23)Online publication date: 2-Jun-2005
  • (2004)Parallel simulation of group behaviorsProceedings of the 36th conference on Winter simulation10.5555/1161734.1161806(364-370)Online publication date: 5-Dec-2004
  • (2004)Parallel Simulation of Group BehaviorsProceedings of the 2004 Winter Simulation Conference, 2004.10.1109/WSC.2004.1371337(356-362)Online publication date: 2004
  • (2004)Radio-wave propagation prediction using ray-tracing techniques on a network of workstations (NOW)Journal of Parallel and Distributed Computing10.1016/j.jpdc.2004.07.00464:10(1127-1156)Online publication date: 1-Oct-2004
  • (2002)A Comparison of Three Programming Models for Adaptive Applications on the Origin2000Journal of Parallel and Distributed Computing10.1006/jpdc.2001.177762:2(241-266)Online publication date: 1-Feb-2002
  • (2000)A comparison of three programming models for adaptive applications on the Origin2000Proceedings of the 2000 ACM/IEEE conference on Supercomputing10.5555/370049.370070(11-es)Online publication date: 1-Nov-2000
  • (2000)A Comparison of Three Programming Models for Adaptive Applications on the Origin2000ACM/IEEE SC 2000 Conference (SC'00)10.1109/SC.2000.10004(11-11)Online publication date: 2000
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media