Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Optimizing the Performance of Virtual Machine Synchronization for Fault Tolerance

Published: 01 December 2011 Publication History

Abstract

Hypervisor-based fault tolerance (HBFT), which synchronizes the state between the primary VM and the backup VM at a high frequency of tens to hundreds of milliseconds, is an emerging approach to sustaining mission-critical applications. Based on virtualization technology, HBFT provides an economic and transparent fault tolerant solution. However, the advantages currently come at the cost of substantial performance overhead during failure-free, especially for memory intensive applications. This paper presents an in-depth examination of HBFT and options to improve its performance. Based on the behavior of memory accesses among checkpointing epochs, we introduce two optimizations, read-fault reduction and write-fault prediction, for the memory tracking mechanism. These two optimizations improve the performance by 31 percent and 21 percent, respectively, for some applications. Then, we present software superpage which efficiently maps large memory regions between virtual machines (VM). Our optimization improves the performance of HBFT by a factor of 1.4 to 2.2 and achieves about 60 percent of that of the native VM.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Computers
IEEE Transactions on Computers  Volume 60, Issue 12
December 2011
158 pages

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 December 2011

Author Tags

  1. Virtualization
  2. checkpoint
  3. fault tolerance.
  4. hypervisor
  5. recovery

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2019)PhantasyIEEE Transactions on Computers10.1109/TC.2018.286594368:2(225-238)Online publication date: 1-Feb-2019
  • (2017)Live migration of virtual machines with their local persistent storage in a data intensive cloudInternational Journal of High Performance Computing and Networking10.5555/3070823.307083710:1-2(134-147)Online publication date: 1-Jan-2017
  • (2017)An Efficient Survivable Design With Bandwidth Guarantees for Multi-Tenant Cloud NetworksIEEE Transactions on Network and Service Management10.1109/TNSM.2017.270442714:2(357-372)Online publication date: 1-Jun-2017
  • (2016)SamsaraProceedings of the 2016 USENIX Conference on Usenix Annual Technical Conference10.5555/3026959.3027010(551-564)Online publication date: 22-Jun-2016
  • (2015)TardigradeProceedings of the 12th USENIX Conference on Networked Systems Design and Implementation10.5555/2789770.2789810(575-588)Online publication date: 4-May-2015
  • (2015)SamsaraProceedings of the 6th Asia-Pacific Workshop on Systems10.1145/2797022.2797028(1-7)Online publication date: 27-Jul-2015
  • (2015)Hotplug or Ballooning: A Comparative Study on Dynamic Memory Management Techniques for Virtual MachinesIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2014.232091526:5(1350-1363)Online publication date: 7-Apr-2015
  • (2013)Cost-Benefit Analysis of Virtualizing Batch SystemsProceedings of the 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing10.5555/2588611.2588703(264-268)Online publication date: 9-Dec-2013
  • (2012)A medical image file accessing system with virtualization fault tolerance on cloudProceedings of the 7th international conference on Advances in Grid and Pervasive Computing10.1007/978-3-642-30767-6_29(338-349)Online publication date: 11-May-2012
  • (2011)Safe side effects commitment for OS-level virtualizationProceedings of the 8th ACM international conference on Autonomic computing10.1145/1998582.1998601(111-120)Online publication date: 14-Jun-2011

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media