Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1109/GRID.2007.4354113guideproceedingsArticle/Chapter ViewAbstractPublication PagesgridConference Proceedingsconference-collections
Article
Free access

High-available grid services through the use of virtualized clustering

Published: 19 September 2007 Publication History

Abstract

Grid applications comprise several components and web-services that make them highly prone to the occurrence of transient software failures and aging problems. This type of failures often incur in undesired performance levels and unexpected partial crashes. In this paper we present a technique that offers high-availability for Grid services based on concepts like virtualization, clustering and software rejuvenation. To show the effectiveness of our approach, we have conducted some experiments with OGSA-DAI middleware. One of the implementations of OGSA-DAI makes use of Apache Axis V1.2.1, a SOAP implementation that suffers from severe memory leaks. Without changing any bit of the middleware layer we have been able to anticipate most of the problems caused by those leaks and to increase the overall availability of the OGSA-DAI Application Server. Although these results are tightly related with this middleware it should be noted that our technique is neutral and can be applied to any other Grid service that is supposed to be high-available.

References

[1]
Oppenheimer, D., Archana Ganapathi, and David A. Patterson. "Why do Internet Services fail, and What can be done about it?" 4th USENIX Symposium on Internet Technologies and Systems (USITS'03), March. 2003.
[2]
A. Avritzer, E. Weyuker, Monitoring Smoothly Degrading Systems for increased Dependability, Empirical Software Eng. Journal, Vol 2, No 1, pp. 59-77, 1997.
[3]
L. Li, K. Vaidyanathan, K. Trivedi. "An Approach for Estimation of Software Aging in a Web-Server", Proc. of the 2002 International Symposium on Empirical Software Engineering (ISESE'02).
[4]
V. Castelli, R. Harper, P. Heidelberg, S. Hunter, K. Trivedi, K. Vaidyanathan, W. Zeggert, "Proactive Management of Software Aging" IBM Journal Research & Development, Vol. 45, No. 2, Mar. 2001.
[5]
K. Cassidy, K. Gross, A. Malekpour. "Advanced Pattern: Recognition for detection of Complex software Aging phenomena in Online Transaction Processing Servers", Proc. of the 2002 Int. Conf. on Dependable Systems and Networks, DSN-2002.
[6]
A. Tai, S. Chau, L. Alkalaj, H. Hecht. "On-Board Preventive Maintenance: Analysis of Effectiveness an Optimal Duty Period", Proc. 3rd Workshop on Object-Oriented Real-Time Dependable Systems, 1997.
[7]
E. Marshall. "Fatal Error: How Patriot Overlooked a Scud". Science, p. 1347, Mar. 1992.
[8]
R. Arpaci-Dusseau, A. Arpaci-Dusseau. "Fail-stutter Fault Tolerance". Proc. 8th Workshop on Hot Topics in Operating Systems. (HOTOSVIII), 2001.
[9]
D. Menascé. "QoS Issues in Web Services". IEEE Internal Computing, Nov-Dec 2002.
[10]
L. Silva, H. Madeira and J.G. Silva. "Software Aging and Rejuvenation in a SOAP-Based Server". IEEE-NCA: Network Computing and Applications, Cambridge USA, July 2006.
[11]
Renato J. Figuereido, Peter A. Dinda, José A. B. Fortes. "A Case For Grid Computing on Virtual Machines". Proc. of the 23rd Int. Conf. on Distributed Computing Systems, p. 550, May 19-22, 2003.
[12]
R. Figuereido, P. Dinda, J. Fortes, "Resource Virtualization Renaissance" IEEE Computer, 38(5), pp. 28-69, May 2005.
[13]
(2007) Xen Source website {Online}. http://www.xensource.com/
[14]
(2007) VMWare website. {Online}. http://www.vmware.com/
[15]
(2007) Virtuoso website. {Online}. http://www.virtuoso.com/
[16]
(2007) LVS website. {Online}. http://www.linuxvirtualserver.org/
[17]
(2007) Ldirectord website. {Online}. http://www.vergenet.net/linux/ldirectord/
[18]
(2007) OGSA-DAI website. {Online}. http://www.ogsadai.org.uk/
[19]
(2007) Apache Axis. {Online}. http://ws.apache.org/axis
[20]
(2007) OGSA-DAI Projects, {Online}. http://www.ogsadai.org.uk/about/projects.php
[21]
W. Hoarau, S. Tixeuil, N. Rodrigues, D. Sousa and L. Silva. "Benchmarking the OGSA-DAI Middleware". CoreGrid Technical Report, No. TR-0060. October 5, 2006. http://www.coregrid.net
[22]
S. Tixeuil, W. Hoarau, L.M. Silva, "An Overview of Existing tools for Fault-Injection and Dependability Benchmarking in Grids", CoreGrid Technical Report TR-0041, http://www.coregrid.net

Cited By

View all
  • (2018)Frequent failure monitoring and reporting in virtualisation environment using backing algorithm techniqueInternational Journal of Information and Communication Technology10.5555/3193269.319327712:1-2(131-142)Online publication date: 1-Jan-2018
  • (2015)Future research directions in design of reliable communication systemsTelecommunications Systems10.1007/s11235-015-9987-760:4(423-450)Online publication date: 1-Dec-2015
  • (2014)Software rejuvenation scheduling using accelerated life testingACM Journal on Emerging Technologies in Computing Systems10.1145/253911810:1(1-23)Online publication date: 13-Jan-2014
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
GRID '07: Proceedings of the 8th IEEE/ACM International Conference on Grid Computing
September 2007
339 pages
ISBN:9781424415595

Publisher

IEEE Computer Society

United States

Publication History

Published: 19 September 2007

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)39
  • Downloads (Last 6 weeks)28
Reflects downloads up to 19 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Frequent failure monitoring and reporting in virtualisation environment using backing algorithm techniqueInternational Journal of Information and Communication Technology10.5555/3193269.319327712:1-2(131-142)Online publication date: 1-Jan-2018
  • (2015)Future research directions in design of reliable communication systemsTelecommunications Systems10.1007/s11235-015-9987-760:4(423-450)Online publication date: 1-Dec-2015
  • (2014)Software rejuvenation scheduling using accelerated life testingACM Journal on Emerging Technologies in Computing Systems10.1145/253911810:1(1-23)Online publication date: 13-Jan-2014
  • (2013)A comparative experimental study of software rejuvenation overheadPerformance Evaluation10.1016/j.peva.2012.09.00270:3(231-250)Online publication date: 1-Mar-2013
  • (2009)Availability analysis of application servers using software rejuvenation and virtualizationJournal of Computer Science and Technology10.1007/s11390-009-9228-124:2(339-346)Online publication date: 1-Mar-2009

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media