Abstract
The grid provides an integrated computer platform composed of differentiated and distributed systems. These resources are dynamic and heterogeneous. In this paper, a novel fault-tolerant grid-scheduling model is presented based on Stochastic Petri Nets (SPN) to assure the heterogeneity and dynamism of the grid system. Also, a new grid-scheduling strategy, the dependable strategy for the shortest expected accomplishing time (DSEAT), is put forward, in which the dependability factor is introduced in the task-dispatching strategy. In the end, the performance of the scheduling strategy based on the fault-tolerant grid-scheduling model is analyzed by an software package, named SPNP. The numerical results show that dynamic resources will increase the response time for all classes of tasks in differing degrees. Compared with shortest expected accomplishing time (SEAT) strategy, the DSEAT strategy can reduce the negative effects of dynamic and autonomic resources to some extent so as to guarantee a high quality of service (QoS).
Similar content being viewed by others
References
Foster I, Kesselman C. The Grid Blueprint for a New Computing Infrastructure. Morgan Kaufmann Publishers, 1998
Baker M A, Buyya R, Laforenza D. The grid: international efforts in global computing. International Journal of software Practice and Experience, 2002, 32(15): 1437–1466
Foster I, Kesselman C, Tuecke S. The anatomy of the grid: enabling scalable virtual organizations. International J. Supercomputer Applications, 2001, 15(3): 200–222
Algirdas A, Laprie J C, Randell B, et al. Basic concepts and taxonomy of dependable and secure computing. IEEE Transactions on Dependable and Secure Computing, 2004, 1(1): 11–33
Gong L G, Sun X H. Performance modeling and prediction of nondedicated network computing. IEEE Trans on Computers, 2002, 51(9): 1041–1055
David M, William H, Kishor S. Model-based evaluation: from dependability to security. IEEE Transactions on Dependable and Secure Computing, 2004, 1(1): 48–65
Ramamurthy S. Scheduling periodic hard real-Time tasks with arbitrary deadlines on multiprocessors real-time systems symposium. 23rd IEEE RTSS 2002, 59–68
Jansen P G, Hanssen F, Lijding M E. Scheduling of early quantum tasks. In: Proceedings of the 15th Euromicro Conference on Real-Time Systems, 2003, 203–210
Palencia J C, Harbour G M. Offset-based response time analysis of distributed systems scheduled under EDF. In: Proceedings of 15th Euromicro Conference on Real-Time Systems, 2003, 3–12
Liu C, Yang L, Foster I, et al. Design and evaluation of a resource selection framework for grid applications. In: Proceedings of the 11th IEEE Int’l Symp. on High-Performance Distributed Computing. Washington: IEEE Computer Society, 2002, 63–72
Kartik S, Murthy S R. Task allocation algorithms for maximizing reliability of distributed computing systems. IEEE Trans. On Computer, 1997, 46(6): 719–724
Qin X, Jiang H, Swanson DR. An efficient fault-tolerant scheduling algorithm for real-time tasks with precedence constraints in heterogeneous systems. In: Proceedings of the 31st Int’l Conf. on Parallel Processing (ICPP 2002), 2002, 360–368
Dutot P. Complexity of master-slave tasking on heterogeneous trees. European Journal on Operational Research, 2005, 164(3): 690–695
Zhang Y Y, Squillante M S, Sivasubramaniam A, Sahoo R K. Performance implications of failures in large-scale cluster scheduling. Job Scheduling Strategies for Parallel Processing, 2005, 233–252
Schroeder B, Gibson G. A large-scale study of failures in high-performance computing systems. In: Proceedings of the International Conference on Dependable Systems and Networks (DSN2006), 2006, 249–258
Sahoo R K, Sivasubramaniam A, Squillante M S, et al. Failure data analysis of a large-scale heterogeneous server environment. In: Proceedings of DSN’04, 2004, 772–781
Shan Z G, Lin C, Ren F Y, et al. Modeling and performance analysis of a multiserver multiqueue system on the grid, distributed computing systems. In: Proceedings of the Ninth IEEE Workshop on Future Trends, 2003, 337–343
Krauter K, Buyya R, Maheswaran M. A taxonomy and survey of Grid resource management systems for distributed computing. Software: Practice and Experience, 2002, 32(2): 135–164
Geunmo K, Hyunsoo Y. On submesh allocation for mesh multicomputers: a best fit allocation and a virtual submesh allocation for faulty meshes. IEEE Trans on Parallel and Distributed Systems, 1998, 9(2): 175–185
Murata T. Petri Nets: properties, analysis and applications. In: Proceedings of the IEEE, 1989, 77(4): 541–580
Ciardo G, Trivedi K S. A decomposition approach for stochastic reward net models. Performance Evaluation, 1993, (18): 37–59
Ciaodo G, Muppala J, Trivedi K S. SPNP: Stochastic Petri Net Package. In: Proceedings of Petri Nets and Performance Models, 1989, 142–151
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Wang, Y., Lin, C., Yang, Y. et al. Performance analysis of a dependable scheduling strategy based on a fault-tolerant grid model. Front. Comput. Sc. China 1, 329–337 (2007). https://doi.org/10.1007/s11704-007-0032-1
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/s11704-007-0032-1