Nothing Special   »   [go: up one dir, main page]

skip to main content
article

A reliable-adaptive scheduler for computational grids with failure recovery and rescheduling mechanisms

Published: 01 May 2011 Publication History

Abstract

Computational grids (CGs) have become an attractive research area as they suggest a suitable environment for developing large scale parallel applications. CGs integrate a large amount of distributed heterogeneous resources into a single powerful platform. However, to make good use of CGs, grid resources should be scheduled efficiently. Various scheduling strategies have been introduced, including static and dynamic behaviours. The former maps tasks to resources at submission time, while the latter operates at schedule time. While static scheduling is unsuitable for the dynamic grid environment, scheduling in CGs is still more complex than the proposed dynamic ones. This paper introduces a decentralised adaptive grid scheduler (AGS) based on a novel rescheduling mechanism. AGS has several salient properties as it is: hybrid, adaptive, decentralised, and efficient. AGS is also robust as it has the ability to: 1) detect resource failures; 2) continue its functionality in spite of the failure existence; 3) recover back as soon as possible. Moreover, it integrates both static and dynamic scheduling behaviours. An initial static scheduling map is proposed for an input directed acyclic graph (DAG). However, DAG tasks may be rescheduled if the hosting resources' performance changes in a way that affects the tasks' response time. AGS tries to overcome drawbacks of traditional schedulers by utilising the mobile agent unique features to enhance the resource discovery and monitoring processes. Experimental results have shown that AGS outperforms traditional grid schedulers as it introduces a better scheduling efficiency.

References

[1]
Armstrong, R., Hensgen, D. and Kidd, T. (1998) 'The relative performance of various mapping algorithms is independent of sizable variances in run-time predictions', in 7th IEEE Heterogeneous Computing Workshop, pp.79-87.
[2]
Baskiyar, S. and Dickinson, C. (2005) 'Scheduling directed acyclic task graphs on a bounded set of heterogeneous processors using task duplication', J. Parallel Distrib. Comput., Vol. 65, No. 8, pp.911-921.
[3]
Bharadwaj, V., Ghose, D., Mani, V. and Robertazzi, T. (1996) Scheduling Divisible Loads in Parallel and Distributed Systems, IEEE Computer Society Press.
[4]
Braun, T., Siegel, H. and Beck, N. (2001) 'A comparison of eleven static heuristics for mapping a class of independent tasks onto heterogeneous distributed computing systems', Journal of Parallel and Distributed Computing, Vol. 61, pp.810-837.
[5]
Ghanea, R., Collis, J. and Ndumu, D. (1999) 'Co-operating mobile agents for distributed parallel processing', Third International Conference on Autonomous Agents, ACM Press.
[6]
Kontothanassis, L. and Goddeau, D. (2005) 'Profile driven scheduling for a heterogeneous server cluster', in Proceedings of the 34th International Conference on Parallel Processing Workshops, pp.336-345.
[7]
Kotz, D. and Gary, R. (2002) Future Directions for Mobile-Agent Research, IEEE Distributed System online, Vol. 3, No. 8.
[8]
Kwok, Y. and Ahmad, I. (1999) 'Benchmarking and comparison of the task graph scheduling algorithms', Journal of Parallel and Distributed Computing, Vol. 59, No. 3, pp.381-422.
[9]
Lang, D. and Oshima, M. (1999) 'Seven good reasons for mobile agents', Communications of the ACM, Vol. 42, No. 8.
[10]
Mandal, A., Kennedy, K., Koelbel, C., Marin, G. and Johnsson, L. (2005) 'Scheduling strategies for mapping application workflows onto the grid', in 14th IEEE Symposium on High Performance Distributed Computing (HPDC14), pp.125-134.
[11]
Overiender, B., Wijngaards, N., Van Steen, M. and Brazier, F. (2002) 'Multi-agent support for internet scale grid management', AISB'02 Symposium on AI and Grid Computing, pp.18-22.
[12]
Topcuoglu, H. and Hariri, S. (2002) 'Performance effective and low-complexity task scheduling for heterogeneous computing', IEEE Transactions on Parallel and Distributed Systems, Vol. 2, No. 13, pp.260-274.
[13]
White, J. (1997) Mobile Agents, MIT Press, Software Agents.
[14]
Yu, Z. and Shi, W. (2007) 'An adaptive rescheduling strategy for grid workflow applications', in Parallel and Distributed Processing Symposium. IPDPS 2007. IEEE International.
[15]
Zhang, Y., Koelbel, C. and Kennedy, K. (2007) 'Relative performance of scheduling algorithms in grid environments', Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid.
[16]
Zhao, H. and Sakellariou, R. (2003) 'An experimental investigation into the rank function of the heterogeneous earliest finish time scheduling algorithm, Proc. of Euro-Par 2003, LNCS, August, Vol. 2790, pp.189-194, Springer-Verlag, Klagenfurt, Austria.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image International Journal of Grid and Utility Computing
International Journal of Grid and Utility Computing  Volume 2, Issue 1
May 2011
75 pages
ISSN:1741-847X
EISSN:1741-8488
Issue’s Table of Contents

Publisher

Inderscience Publishers

Geneva 15, Switzerland

Publication History

Published: 01 May 2011

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Dec 2024

Other Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media