Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/525592.823109guideproceedingsArticle/Chapter ViewAbstractPublication PageshpdcConference Proceedingsconference-collections
Article

Scheduling From the Perspective of the Application

Published: 06 August 1996 Publication History

Abstract

Metacomputing is the aggregation of distributed and high-performance resources on coordinated networks. With careful scheduling, resource-intensive applications can be implemented efficiently on metacomputing systems at the sizes of interest to developers and users. In this paper, we focus on the problem of scheduling applications on metacomputing systems. We introduce the concept of application-centric scheduling in which every- thing about the system is evaluated in terms of its impact on the application. Application-centric scheduling is used by virtually all metacomputer programmers to achieve performance on metacomputing systems. We describe two successful metacomputing applications to illustrate this approach, and describe AppLeS scheduling agents which generalize the application-centric scheduling approach. Finally, we show preliminary results which compare AppLeS-derived schedules with conventional strip and blocked schedules for a two- dimensional Jacobi code. * This paper can be located on the World Wide Web at http://www-cse.ucsd.edu/users/berman/apples.html

References

[1]
AppLES. http://www-cse.ucsd.edu/users/berman/ apples.html.
[2]
BERMAN, F., WOLSKI, R., FIGUEIRA, S., SCHOPF, J., AND SHAO, G. Application-level scheduling on distributed heterogeneous networks. Submitted to Supercomputing '96.
[3]
BREWER, E.A. High-level oprimization via automated statistical modeling. In Proceedings of Principles and Practice of Parallel Programming, PPoPP'95 (1995), pp. 80-91.
[4]
CASAVANT, T., AND KUHL, J. A taxonomy of scheduling in general-purpose distributed computing systems. IEEE Transactions on Software Engineering 14, 2 (February 1988).
[5]
CLEO. http://w4.lns.cornell.edu/public/ public.html.
[6]
FEITELSON, D. A survey of scheduling in multiprogrammed parallel systems. Tech. Rep. RC 19790 (87657), IBM Research Division, October 1994.
[7]
FIGUEIRA, S.M., AND BERMAN, F. Modeling the effects of contention on the performance of heterogeneous applications. Proceedings of the High Performance Distributed Computing Conference (1996).
[8]
FINK, S.J., BADEN, S.B., AND KOHN, S.R. Flexible communication mechanisms for dynamic structured applications. Proceedings of IRREGULAR'96.
[9]
FREUND, R., Ed. Proceedings of the 1996 IPPS Workshop on Heterogeneous Computing.
[10]
GEHRINF, J., AND REINFELD, A. Mars - a framework for minimizing the job execution time in a metacomputing environment. Proceedings of Future General Computer Systems (1996).
[11]
GLOBUS. http://www.mes.anl.goy/globus.
[12]
GUSTAFSON, J. The consequences of fixed time performance measurement. Proceedings of the 25th Hawaii International Conference on System Sciences (Jan 1992), 113-124.
[13]
HENSGEN, D.A., MOORE, L., KIDD, T., FREUND, R., KEITH, E., KUSSOW, M., LIMA, J., AND CAMPBELL, M. Adding rescheduling to and integrating condor with smartnet. Proceedings of the Heterogeneous workshop (1995).
[14]
LEGION. http://www.es.virginia.edu/~mentat/ legion/legion.html.
[15]
LEVI, B.G. The geometric phase Shows up in chemical reactions. Physics Today 46, 3 (March 1993), 17-19.
[16]
LOWECAMP, B., AND BEGUELlN, A. ECO: Efficient collective operations for communication on heterogeneous networks. Proceedings of IPPS (April 1996).
[17]
MARZULLO, K., OGG, M., RICCIARDI, A., AMOROSO, A., CALKINS, F., AND ROTHFUS, E. Nile: Wide-area computing for high energy physics. Proceedings of the 1996 SIGOPS Conference.
[18]
NILE. http://www.nile.utexas.edu/.
[19]
PRUYNE, J., AND LIVNY, M. Parallel processing on dynamic resources with carmi. In Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing, IPPS'95 (April 1995).
[20]
PVM. http://www.epm.ornl.gov:80/pvm/.
[21]
RUDOLPH, L., AND FEITELSON, D., Eds. Proceedings of the 1996 IPPS Workshop on Job Scheduling Strategies for Parallel Processing.
[22]
SARKAR, V. Automatic partitioning of a program dependence graph into parallel tasks. IBM Journal of Research and Devlopment 35, 5/6 (Sept/Nov 1991).
[23]
SHIRAZI, B., HURSON, A., AND KAVI, K. Scheduling and Load Balancing in Parallel and Distributed Systems. IEEE Computer Society Press, 1995.
[24]
SIEGEL, H., ANTONIO, J., METZGER, R., TAN, M., AND LI, Y.A. Heterogeneous computing. Tech. rep., Purdue University EE Technical Report TR-EE94-37.
[25]
TANNENBAUM, T., AND LITZKOW, M. The condor distributed processing system. Dr. Dobbs Journal (February 1995).
[26]
WAN, M., MOORE, R., KREMENEK, G., AND STEUBE, K. A batch scheduler for the intel paragon MPP system with a non-contiguous node allocation. Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing (1996).
[27]
WEISSMAN, J. The interference paradigm for network job scheduling. Proceedings of the IPPS Workshop on Heterogeneous Computing (1996).
[28]
Wu, M., AND KUPPERMANN, A. Casa quantum chemical reaction dynamics. In CASA Gigabit Network Testbed Annual Report (1994).
[29]
Wu, Y.-S.M., AND KUPPERMANN, A. Prediction of the effect of the geometric phase on product rotational state distributions and integral crosssections. Chemical Physics Letters 201 (January 1993), 178-86.
[30]
YANG, T., AND GERASOULS, A. DSC: Scheduling parallel tasks on an unbounded number of processors. IEEE Transactions on Parallal and Distributed Systems 5, 9 (1994), 951-967.
[31]
ZHANG, X., AND YAN, Y. A framework of performance prediction of parallel computing nondedicated heterogeneous NOW. In Proceedings of the 1995 International Conference on Parallel Processing (1995), pp. 163-7.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
HPDC '96: Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
August 1996
ISBN:0818675829

Publisher

IEEE Computer Society

United States

Publication History

Published: 06 August 1996

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 166 of 966 submissions, 17%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2010)Load Balancing Content-Based Publish/Subscribe SystemsACM Transactions on Computer Systems10.1145/1880018.188002028:4(1-55)Online publication date: 1-Dec-2010
  • (2010)Grid broker selection strategies using aggregated resource informationFuture Generation Computer Systems10.1016/j.future.2009.07.00926:1(72-86)Online publication date: 1-Jan-2010
  • (2009)Tuning parallel applications in parallelParallel Computing10.1016/j.parco.2009.07.00135:8-9(475-492)Online publication date: 1-Aug-2009
  • (2008)The XtreemOS jSchedulerFirst USENIX Workshop on Large-Scale Computing10.5555/1411725.1411728(1-10)Online publication date: 22-Jun-2008
  • (2008)Inter-operating grids through Delegated MatchMakingScientific Programming10.1155/2008/30496516:2-3(233-253)Online publication date: 1-Apr-2008
  • (2007)A job self-scheduling policy for HPC infrastructuresProceedings of the 13th international conference on Job scheduling strategies for parallel processing10.5555/1791551.1791555(51-75)Online publication date: 17-Jun-2007
  • (2007)A distributed system for genetic linkage analysisProceedings of the 2006 international conference on Distributed, high-performance and grid computing in computational biology10.5555/1765588.1765600(110-123)Online publication date: 21-Jan-2007
  • (2007)Orchestration of Network-Wide Active Measurements for Supporting Distributed Computing ApplicationsIEEE Transactions on Computers10.1109/TC.2007.7074556:12(1629-1642)Online publication date: 1-Dec-2007
  • (2006)Dynamic load balancing in distributed content-based publish/subscribeProceedings of the ACM/IFIP/USENIX 2006 International Conference on Middleware10.5555/1515984.1515996(141-161)Online publication date: 1-Nov-2006
  • (2006)MPI performance analysis tools on Blue Gene/LProceedings of the 2006 ACM/IEEE conference on Supercomputing10.1145/1188455.1188583(123-es)Online publication date: 11-Nov-2006
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media