Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/504390.504396acmconferencesArticle/Chapter ViewAbstractPublication PagesewConference Proceedingsconference-collections
Article
Free access

On group communication in large-scale distributed systems

Published: 12 September 1994 Publication History

Abstract

An increasing number of applications with reliability requirements are being deployed in distributed systems that span large geographic distances or manage large numbers of objects. We consider the process group mechanism as an appropriate application structuring paradigm in such large-scale distributed systems. We give a formal characterization for the attribute "large scale" as applied to distributed systems and examine the technical problems that need to be solved in making group technology scalable. Our design advocates multiple roles for group membership over a minimal set of abstractions and primitives. The design is currently being implemented on top of "off-the-shelf" technologies for both communication and computation.

References

[1]
Y. Amir, D. Dolev, S. Kramer and D. Malki. Transis: A Communication Sub-System for High Availability. In Proc. 22nd Annual International Symposium on Fault-Tolerant Computing, pages 76-84, July 1992.
[2]
O. Babaoglu, M.G. Baker, R. Davoli, and L.A. Giachini. RELACS: A Communications Infrastructure for Constructing Reliable Applications in Large-Scale Distributed Systems. Technical Report UBLCS-94-15, Laboratory for Computer Science, University of Bologna, Italy, June 1994.
[3]
K. Birman. The Process Group Approach to Reliable Distributed Computing, Communication of the ACM, 9(12):36-53, December 1993.
[4]
K. Birman and R. Cooper. The ISiS Project: Real Experience with a Fault-Tolerant Programming System. ACM SIGOPS Operating Systems Review, 25(2);103-107, April 1991.
[5]
K. Birman, A. Schiper and P. Stephenson. Lightweight Causal and Atomic Multicast. ACM Trans. Computing Systems, 9(3):272-314, August 1991.
[6]
T.D. Chandra and S. Toueg. Unreliable Failure Detectors for Asynchronous Systems. In Proceedings lOth ACM Symposium on Principles of Distributed Computing, pages 325-340. ACM, August 1991.
[7]
D.R. Cheriton and W. Zwaenepoel. Distributed Process Groups in the V Kernel. ACM Trans. Comput. Syst. 3(2):77-107, May 1985.
[8]
P. Felber, C. Malloth, A, Schiper and U. Wilhelm. Phoenix: A Group-Oriented Infrastructure for Large- Scale Distributed Systems. Technical Report, EPFL-LSE, Lausanne, Switzerland. In preparation.
[9]
Michael J. Fischer, Nancy A. Lynch, and Michael S. Paterson. Impossibility of Distributed Consensus with One Faulty Process. Journal of ACM, 32(2):374-382, April 1985.
[10]
L. Liang, S.T. Chanson and G.W. Neufeld Process Groups and Group Communications: Classifications and Requirements. IEEE Computer, 23(2):56-66, February 1990.
[11]
L.L. Peterson, N.C. Bucholz, and R.D. Schlichting. Preserving and using context information in interprocess communication. ACM Transactions on Computer Systems, 7(3):217-246, August 1989.
[12]
A. Ricciardi, A. Schiper and K. Birman, Understanding Partitions and the "No Partition" Assumption. In Proc. 4th IEEE Workshop on Future Trends of Distributed Systems, Lisboa, September 1993.
[13]
R. van Renesse, K. Birman, R. Cooper, B. Glade and P. Stephenson. The Horus System. In Reliable Distributed Computing with the Isis Toolkit, K.P. Birman, R. van Renesse (Ed.), IEEE Computer Society Press, Los Alamitos, CA, pages 133-147, 1993.
[14]
A. Schiper and A. Sandoz. Uniform Reliable Multicast in a Virtually Synchronous Environment. In Proc. 13th Int. Conference on Distributed Computing Systems, pages 501-568, May 1993.
[15]
A. Schiper and A. Ricciardi. Virtually-Synchronous Communication Based on a Weak Failure Suspector. In Proc. 23rd Int. Syrup. on Fault-Tolerant Computing, Toulouse, pages 534-543, June 1993.

Cited By

View all
  • (2007)Jgroup/ARM: a distributed object group platform with autonomous replication managementSoftware: Practice and Experience10.1002/spe.85338:9(885-923)Online publication date: 3-Oct-2007
  • (2005)Client Group Membership as an Architectural Approach for Dependability in Large Scale SystemsProceedings of the Proceedings of the 38th Annual Hawaii International Conference on System Sciences - Volume 0910.1109/HICSS.2005.140Online publication date: 3-Jan-2005
  • (2005)A novel group communication protocol using the RingNet hierarchy in mobile InternetInternational Journal of Parallel, Emergent and Distributed Systems10.1080/1744576050014235720:3-4(253-280)Online publication date: Sep-2005
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
EW 6: Proceedings of the 6th workshop on ACM SIGOPS European workshop: Matching operating systems to application needs
September 1994
201 pages
ISBN:9781450373388
DOI:10.1145/504390
  • General Chair:
  • Jurgen Nehmer,
  • Program Chair:
  • Marc Shapiro
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 September 1994

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

EW94
Sponsor:
EW94: 1994 ACM SIGOPS European Workshop
September 12 - 14, 1994
Wadern, Germany

Acceptance Rates

Overall Acceptance Rate 37 of 37 submissions, 100%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)50
  • Downloads (Last 6 weeks)6
Reflects downloads up to 27 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2007)Jgroup/ARM: a distributed object group platform with autonomous replication managementSoftware: Practice and Experience10.1002/spe.85338:9(885-923)Online publication date: 3-Oct-2007
  • (2005)Client Group Membership as an Architectural Approach for Dependability in Large Scale SystemsProceedings of the Proceedings of the 38th Annual Hawaii International Conference on System Sciences - Volume 0910.1109/HICSS.2005.140Online publication date: 3-Jan-2005
  • (2005)A novel group communication protocol using the RingNet hierarchy in mobile InternetInternational Journal of Parallel, Emergent and Distributed Systems10.1080/1744576050014235720:3-4(253-280)Online publication date: Sep-2005
  • (2004)RGB: a scalable and reliable group membership protocol in mobile InternetInternational Conference on Parallel Processing, 2004. ICPP 2004.10.1109/ICPP.2004.1327939(326-333 vol.1)Online publication date: 2004
  • (2004)Extended membership problem for open groupsProceedings of the 6th international conference on High Performance Computing for Computational Science10.1007/11403937_23(288-301)Online publication date: 28-Jun-2004
  • (2000)A scalable membership service for group communications in WANsProceedings. 2000 Pacific Rim International Symposium on Dependable Computing10.1109/PRDC.2000.897285(59-66)Online publication date: 2000
  • (1996)Structured virtual synchronyProceedings of the 7th workshop on ACM SIGOPS European workshop: Systems support for worldwide applications10.1145/504450.504488(213-217)Online publication date: 9-Sep-1996
  • (1996)A stake in cyberspaceProceedings of the 7th workshop on ACM SIGOPS European workshop: Systems support for worldwide applications10.1145/504450.504466(83-88)Online publication date: 9-Sep-1996
  • (1996)On the impossibility of group membershipProceedings of the fifteenth annual ACM symposium on Principles of distributed computing10.1145/248052.248120(322-330)Online publication date: 1-May-1996
  • (1995)Configurable highly available distributed servicesProceedings. 14th Symposium on Reliable Distributed Systems10.1109/RELDIS.1995.526219(118-127)Online publication date: 1995
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media