Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/1251522.1251525guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

MON: on-demand overlays for distributed system management

Published: 13 December 2005 Publication History

Abstract

This paper presents the management overlay network (MON) system that we are building and running on the PlanetLab testbed. MON is a distributed system designed to facilitate the management of large distributed applications. Toward this goal, MON builds on-demand overlay structures that allow users to execute instant management commands, such as query the current status of the application, or push software updates to all the nodes. The on-demand approach enables MON to be light-weight, requiring minimum amount of resources when no commands are executed. It also frees MON from complex failure repair mechanisms, since no overlay structure is maintained for a prolonged time. MON is currently running on more than 300 nodes on the Planet-Lab. Our initial experiments on the PlanetLab show that MON has good performance, both in terms of command response time and achieved bandwidth for software push.

References

[1]
{1} CoMon. http://comon.cs.princeton.edu/.
[2]
{2} CoTop. http://codeen.cs.princeton.edu/cotop/.
[3]
{3} MON. http://cairo.cs.uiuc.edu/mon/.
[4]
{4} Planetlab application manager. http://appmanager.berkeley.intel-research.net/.
[5]
{5} PSSH. http://www.theether.org/pssh/.
[6]
{6} Stork. http://www.cs.arizona.edu/stork/.
[7]
{7} vxargs. http://dharma.cis.upenn.edu/planetlab/vxargs/.
[8]
{8} R. Adams. Distributed system management: Planetlab incidents and management tools. PlanetLab Design Notes PDN-03-015.
[9]
{9} M. Castro, P. Druschel, A.-M. Kermarrec, A. Nandi, A. Rowstron, and A. Singh. Split-Stream: High-bandwidth content distribution in a cooperative environment. In SOSP'03, 2003.
[10]
{10} B. Chun, J. Hellerstein, R. Huebsch, P. Maniatis, and T. Roscoe. Design considerations for information planes. In WORLDS'04, December 2004.
[11]
{11} R. Huebsch, J. Hellerstein, N. Lanham, B. T. Loo, S. Shenker, and I. Stoica. Querying the internet with pier. In VLDB, 2003.
[12]
{12} A.-M. Kermarrec, L. Massoulie, and A. J. Ganesh. Probabilistic reliable dissemination in large-scale systems. IEEE Transaction on Parallel and Distributed Systems, 14(2), February 2003.
[13]
{13} D. Kostic, A. Rodriguez, J. Albrecht, and A. Vahdat. Bullet: High bandwidth data dissemination using an overlay mesh. In SOSP'03, October 2003.
[14]
{14} D. Kostoulas, D. Psaltoulis, I. Gupta, K. Birman, and A. Demers. Decentralized schemes for size estimation in large and dynamic groups. In IEEE Symp. Network Computing and Applications, 2005.
[15]
{15} M. L. Massie, B. N. Chun, and D. E. Culler. The ganglia distributed monitoring system: Design, implementation, and experience. Parallel Computing, 30, July 2004.
[16]
{16} D. Oppenheimer, J. Albrecht, D. Patterson, and A. Vahdat. Distributed resource discovery on planetlab with sword. In WORLDS'04, December 2004.
[17]
{17} K. Park and V. S. Pai. Deploying large file transfer on an http content distribution network. In WORLDS'04, December 2004.
[18]
{18} L. Peterson, T. Anderson, D. Culler, and T. Roscoe. A blueprint for introducing disruptive technology into the internet. In HotNets-I, 2002.
[19]
{19} X. Zhang, J. Liu, B. Li, and T.-S. P. Yum. DONet: A data-driven overlay network for efficient live media streaming. In IEEE INFOCOM'05, Miami, FL, 2005.

Cited By

View all
  • (2010)Evaluation of QoS-compliant overlays under denial of service attacksProceedings of the 2010 Spring Simulation Multiconference10.1145/1878537.1878644(1-8)Online publication date: 11-Apr-2010
  • (2010)A survey on the design, applications, and enhancements of application-layer overlay networksACM Computing Surveys10.1145/1824795.182480043:1(1-34)Online publication date: 3-Dec-2010
  • (2010)MonalyticsProceedings of the 7th international conference on Autonomic computing10.1145/1809049.1809073(141-150)Online publication date: 7-Jun-2010
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
WORLDS'05: Proceedings of the 2nd conference on Real, Large Distributed Systems - Volume 2
December 2005
71 pages

Publisher

USENIX Association

United States

Publication History

Published: 13 December 2005

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2010)Evaluation of QoS-compliant overlays under denial of service attacksProceedings of the 2010 Spring Simulation Multiconference10.1145/1878537.1878644(1-8)Online publication date: 11-Apr-2010
  • (2010)A survey on the design, applications, and enhancements of application-layer overlay networksACM Computing Surveys10.1145/1824795.182480043:1(1-34)Online publication date: 3-Dec-2010
  • (2010)MonalyticsProceedings of the 7th international conference on Autonomic computing10.1145/1809049.1809073(141-150)Online publication date: 7-Jun-2010
  • (2009)CensusProceedings of the 2009 conference on USENIX Annual technical conference10.5555/1855807.1855819(12-12)Online publication date: 14-Jun-2009
  • (2009)RhizomaProceedings of the ACM/IFIP/USENIX 10th international conference on Middleware10.5555/1813355.1813369(184-204)Online publication date: 30-Nov-2009
  • (2009)RhizomaProceedings of the 10th ACM/IFIP/USENIX International Conference on Middleware10.5555/1656980.1656994(1-20)Online publication date: 30-Nov-2009
  • (2008)MoaraProceedings of the 9th ACM/IFIP/USENIX International Conference on Middleware10.5555/1496950.1496975(408-428)Online publication date: 2-Dec-2008
  • (2008)Efficient on-demand operations in dynamic distributed infrastructuresProceedings of the 2nd Workshop on Large-Scale Distributed Systems and Middleware10.1145/1529974.1529986(1-5)Online publication date: 15-Sep-2008
  • (2007)Reliable on-demand management operations for large-scale distributed applicationsACM SIGOPS Operating Systems Review10.1145/1317379.131739241:5(82-88)Online publication date: 1-Oct-2007
  • (2007)Cloud control with distributed rate limitingACM SIGCOMM Computer Communication Review10.1145/1282427.128241937:4(337-348)Online publication date: 27-Aug-2007
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media