Article

Free access

Transparent adaptive parallelism on NOWs using OpenMP

Authors:

Willy ZwaenepoelAuthors Info & Claims

PPoPP '99: Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming

Pages 96 - 106

https://doi.org/10.1145/301104.301113

Published: 01 May 1999 Publication History

Abstract

We present a system that allows OpenMP programs to execute on a network of workstations with a variable number of nodes. The ability to adapt to a variable number of nodes allows a program to take advantage of additional nodes that become available after it starts execution, or to gracefully scale down when the number of available nodes is reduced. We demonstrate that the cost of adaptation is modest; the system allows a program to adapt at a moderate rate without much performance loss.Two ideas underlie the efficiency of our design. First, we recognize that OpenMP programs exhibit convenient adaptation points during their execution, points at which the cost of adaptation can be much reduced. Second, by allowing a process a certain grace period before it must leave a node, we insure that most adaptations can occur at these adaptation points, and thus at low cost. Migration of a process, a much more expensive method for providing adaptivity, is used only as a back-up solution, when the process cannot reach an adaptation point within the grace period.Our implementation consists of an OpenMP pre-processor that generates TreadMarks distributed shared memory (DSM) programs, and a version of TreadMarks modified to adapt to a variable number of nodes. Using a DSM as the underlying substrate facilitates the data (re-)distribution necessary after an adaptation.

References

[1]

S. E Amarasinghe, J. M. Anderson, M. S. Lam, and C. W. Tseng. An Overview of the SUIF Compiler for Scalable Parallel Machines. In Proceedings of the 7th SIAM Conference on Parallel Processing for Scientific Computing, pages 662-667, San Francisco, February 1995.]]

[2]

C. Amza, A.L. Cox, S. Dwarkadas, E Keleher, H. Lu, R. Rajamony, W. Yu, and W. Zwaenepoel. Treadmarks: Shared Memory Computing on Networks of Workstations. IEEE Computer, 29(2): 18-28, February 1996.]]

Digital Library

[3]

T.E. Anderson, D.E. Culler, and D.A. Patterson. A Case for NOW (Networks of Workstations). IEEE Micro, 15(2):54-64, February 1995.]]

Digital Library

[4]

R.D. Blumofe and P.A. Lisiecki. Adaptive and Reliable Parallel Computing on Network of Workstations. In Proceedings of the USENIX 1997 Annual Technical Symposium, pages 133-147, January 1997.]]

Digital Library

[5]

N. Carriero, E. Freeman, D. Gelernter, and D. Kaminsky. Adaptive Parallelism and Piranha. IEEE Computer, 28(1):40-49, January 1995.]]

Digital Library

[6]

K.M. Chandy and L. Lamport. Distributed Snapshots: Determining Global States of Distributed Systems. ACM Transactions on Computer Systems, 3(1):63-75, February 1985.]]

Digital Library

[7]

E Douglis and J. Ousterhout. Process Migration in the Sprite Operating System. In Proceedings of the 5th International Conference on Distributed Computing Systems, pages 18-25, September 1987.]]

[8]

G. Edjlali, G. Agrawal, A. Sussman, J. Humphries, and J. Saltz. Compiler and Runtime Support for Programming in Adaptive Parallel Environments. Scientific Programming, 6(2):215-227, Jan 1997.]]

[9]

P. J. Hatcher and M. J. Quinn. Data-parallel Programming on MIMD Computers. The MIT Press, Cambridge MA, 1991.]]

Digital Library

[10]

S. Ioannidis and S. Dwarkadas. Compiler and Run- Time Support for Adaptive Load Balancing in Software Distributed Memory Systems. In Languages, Compilers, and Run-Time Systems for Scalable Computers (Proc. 4th Intl. Workshop LCR'98), pages 107-122, Pittsburgh, PA, May 1998. Springer Verlag.]]

Digital Library

[11]

A. Itzkovitz, A. Schuster, and L. Shalev. Thread Migration and its Applications in Distributed Shared Memory Systems. Journal of Systems and Software, 42(1):71- 87, 1997.]]

Digital Library

[12]

D.B. Johnson and W. Zwaenepoel. Recovery in Distributed Systems Using Optimistic Message Logging and Checkpointing. Journal of Algorithms, 11 (3):462- 491, September 1990.]]

Digital Library

[13]

C. Koelbel, D. Loveman, G. Steele, and M. Zosel. The High Performance Fortran Handbook. The MIT Press, Cambridge, MA, 1994.]]

Digital Library

[14]

R. Konuru, S. Otto, and J. Walpole. A Migratable User- Level Process Package for PVM. Journal of Parallel and Distributed Computing, 40(1): 81-102, Jan 1997.]]

Digital Library

[15]

K. Li and P. Hudak. Memory Coherence in Shared Virtual Memory Systems. AcM Transactions on Computer Systems, 7(4):321-359, November 1989.]]

Digital Library

[16]

M. Litzkow, M. Livny, and M. Mutka. Condor # a Hunter of Idle Workstations. In Proc. 8th Intl. Conf. Distributed Computing Systems, pages 104-111, June 1988.]]

[17]

H. Lu, Y. C. Hu, and W. Zwaenepoel. OpenMP on Networks of Workstations. In Proc. Supercomputing '98, Orlando, FL, November 1998. ACM/IEEE.]]

Digital Library

[18]

N. Nedeljkovic and M.J. Quinn. Data-parallel Programming on a Network of Heterogeneous Workstations. Concurrency: Practice & Experience, 5(4):257- 268, June 1993.]]

[19]

D.A. Nichols. Using Idle Workstations in a Shared Computing Environment. In Proc. lOth ACM Symp. Operating Systems Principles, pages 5-12, November 1987.]]

Digital Library

[20]

OpenMP Group. http://www.openmp.org, 1997.]]

[21]

J.S. Plank, M. Beck, G. Kingsley, and K. Li. Libckpt: Transparent Checkpointing under Unix. In Proceedings of the 1995 Winter Usenix Conference, pages 213- 223, January 1995.]]

Digital Library

[22]

L. Wang, J. Stichnoth, and S. Chatterjee. Runtime Performance of Parallel Array Assignment: An Empirical Study. In Proc. Supercomputing '96, Pittsburgh, PA, November 1996. ACMflEEE.]]

Digital Library

Cited By

Rabenseifner RWellein G(2016)Communication and Optimization Aspects of Parallel Programming Models on Hybrid ArchitecturesThe International Journal of High Performance Computing Applications10.1177/109434200301700100517:1(49-62)Online publication date: 26-Jul-2016
https://doi.org/10.1177/1094342003017001005
Sosonkina MWatson LRadcliffe NHaftka RTrosset M(2013)Adjusting process count on demand for petascale global optimizationParallel Computing10.1016/j.parco.2012.11.00139:1(21-35)Online publication date: 1-Jan-2013
https://dl.acm.org/doi/10.1016/j.parco.2012.11.001
Rabenseifner R(2009)Communication Bandwidth of Parallel Programming Models on Hybrid ArchitecturesProceedings of the 4th International Symposium on High Performance Computing10.1007/3-540-47847-7_37(401-412)Online publication date: 18-May-2009
https://dl.acm.org/doi/10.1007/3-540-47847-7_37
Show More Cited By

Index Terms

Transparent adaptive parallelism on NOWs using OpenMP
1. Computing methodologies
  1. Concurrent computing methodologies
    1. Concurrent programming languages
2. Software and its engineering
  1. Software notations and tools
    1. General programming languages
      1. Language types
        Concurrent programming languages
  2. Software organization and properties
    1. Contextual software domains
      1. Operating systems
        Communications management

Recommendations

Transparent adaptive parallelism on NOWs using OpenMP

We present a system that allows OpenMP programs to execute on a network of workstations with a variable number of nodes. The ability to adapt to a variable number of nodes allows a program to take advantage of additional nodes that become available ...
Adaptive Parallelism for OpenMP Task Parallel Programs
LCR '00: Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers

We present a system that allows task parallel OpenMP programs to execute on a network of workstations (NOW) with a variable number of nodes. Such adaptivity, generally called adaptive parallelism, is important in a multi-user NOW environment, enabling ...
OpenMP for Networks of SMPs

In this paper, we present the first system that implements OpenMP on a network of shared-memory multiprocessors. This system enables the programmer to rely on a single, standard, shared-memory API for parallelization within a multiprocessor and between ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

PPoPP '99: Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming

May 1999

192 pages

ISBN:1581131003

DOI:10.1145/301104

Chairmen:
Marc Snir
IBM T. J. Watson Research Center, Yorktown Heights, NY
,
Andrew A. Chien
Univ. of California, San Diego, San Diego

ACM SIGPLAN Notices Volume 34, Issue 8
Aug. 1999
192 pages
ISSN:0362-1340
EISSN:1558-1160
DOI:10.1145/329366
Editor:
A. Michael Burman
Rowan Univ., Glassboro, NJ
Issue’s Table of Contents

Copyright © 1999 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGPLAN: ACM Special Interest Group on Programming Languages

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 May 1999

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Article

Conference

PPoPP99

Sponsor:

SIGPLAN

PPoPP99: Symposium on Principles & Practice of Parallel Programming

May 4 - 6, 1999

Georgia, Atlanta, USA

Acceptance Rates

PPoPP '99 Paper Acceptance Rate 17 of 79 submissions, 22%;

Overall Acceptance Rate 230 of 1,014 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

20
Total Citations
View Citations
461
Total Downloads

Downloads (Last 12 months)71
Downloads (Last 6 weeks)10

Reflects downloads up to 19 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Rabenseifner RWellein G(2016)Communication and Optimization Aspects of Parallel Programming Models on Hybrid ArchitecturesThe International Journal of High Performance Computing Applications10.1177/109434200301700100517:1(49-62)Online publication date: 26-Jul-2016
https://doi.org/10.1177/1094342003017001005
Sosonkina MWatson LRadcliffe NHaftka RTrosset M(2013)Adjusting process count on demand for petascale global optimizationParallel Computing10.1016/j.parco.2012.11.00139:1(21-35)Online publication date: 1-Jan-2013
https://dl.acm.org/doi/10.1016/j.parco.2012.11.001
Rabenseifner R(2009)Communication Bandwidth of Parallel Programming Models on Hybrid ArchitecturesProceedings of the 4th International Symposium on High Performance Computing10.1007/3-540-47847-7_37(401-412)Online publication date: 18-May-2009
https://dl.acm.org/doi/10.1007/3-540-47847-7_37
Vanneschi MVeraldi L(2007)Dynamicity in distributed applicationsParallel Computing10.1016/j.parco.2007.08.00133:12(822-845)Online publication date: 1-Dec-2007
https://dl.acm.org/doi/10.1016/j.parco.2007.08.001
Rabenseifner RWellein G(2005)Comparison of Parallel Programming Models on Clusters of SMP NodesModeling, Simulation and Optimization of Complex Processes10.1007/3-540-27170-8_31(409-425)Online publication date: 2005
https://doi.org/10.1007/3-540-27170-8_31
Aldinucci MPetrocelli APistoletti ETorquati MVanneschi MVeraldi LZoccolo C(2005)Dynamic reconfiguration of grid-aware applications in ASSISTProceedings of the 11th international Euro-Par conference on Parallel Processing10.1007/11549468_84(771-781)Online publication date: 30-Aug-2005
https://dl.acm.org/doi/10.1007/11549468_84
Weatherly DLowenthal DNakazawa MLowenthal FMcGraw J(2003)Dyn-MPIProceedings of the 2003 ACM/IEEE conference on Supercomputing10.1145/1048935.1050156Online publication date: 15-Nov-2003
https://dl.acm.org/doi/10.1145/1048935.1050156
Löf HRadović ZHagersten E(2003)THROOM – Supporting POSIX Multithreaded Binaries on a ClusterEuro-Par 2003 Parallel Processing10.1007/978-3-540-45209-6_105(760-769)Online publication date: 2003
https://doi.org/10.1007/978-3-540-45209-6_105
Jiang HChaudhary V(2002)Compile/Run-Time Support for Thread MigrationProceedings of the 16th International Parallel and Distributed Processing Symposium10.5555/645610.661541Online publication date: 15-Apr-2002
https://dl.acm.org/doi/10.5555/645610.661541
Jamieson PBilas A(2002)CableS : thread control and memory management extensions for shared virtual memory clustersProceedings Eighth International Symposium on High Performance Computer Architecture10.1109/HPCA.2002.995716(263-274)Online publication date: 2002
https://doi.org/10.1109/HPCA.2002.995716
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents