Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1416729.1416777acmconferencesArticle/Chapter ViewAbstractPublication PagesnotereConference Proceedingsconference-collections
research-article

Agreement and consistency without knowing the number of processes

Published: 23 June 2008 Publication History

Abstract

We study in this paper three classical problems of fault tolerance in a system where the set of processes is unknown. These three problems are: the consensus, the implementation of atomics registers and the eventual leader election.
For this, we consider different models. In the first one, the communication and the processes are asynchronous. In this model, these three problems could not be solved, but we define the weakest failure detectors needed to solve them.
We consider then a model where the processes and the communication are synchronous, which permit to realize synchronous rounds. In this case, the processes are created dynamically and may have crash failures. We prove that, if for all rounds at least one process is alive in two consecutive rounds, the consensus and the implementation of registers could be solved. The eventual leader election, which is in this case less interesting, can be solved also.
Between these two extremities, we focus on the case where the communications are asynchronous. Concerning processes, we assume that, onetime a process is created, it remains alive forever. In this case, if the leader election is easy, the consensus and the implementation of registers are impossible. If we augment the system with the failure detector (Σ) which permits to realize a quorum, consensus and implementation of atomic register can be solved.
At the end, we consider a partially synchronous model and we prove that the consensus and the implementation of atomic register could be solved if there exists a process that can communicate synchronously with the other processes.

References

[1]
Marcos K. Aguilera. A pleasant stroll through the land of infinitely many creatures. Technical report, ACM SIGACT News Distributed Computing Column, 2004.
[2]
Marcos K. Aguilera, Carole Delporte-Gallet, Hugues Fauconnier, and Sam Toueg. On implementing omega with weak reliability and synchrony assumptions. In Proceedings of the 22nd ACM Symposium on Principles of Distributed Computing, pages 306--314, Boston, Massachusetts, USA, July 2003.
[3]
Hagit Attiya, Amotz Bar-Noy, and Danny Dolev. Sharing memory robustly in message-passing systems. J. ACM, 42(1):124--142, 1995.
[4]
Roberto Baldoni, Marin Bertier, Michel Raynal, and Sara Tucci Piergiovanni. Looking for a definition of dynamic distributed systems. In Victor E. Malyshkin, editor, PaCT, volume 4671 of Lecture Notes in Computer Science, pages 1--14. Springer, 2007.
[5]
Michael Ben-Or. Another advantage of free choice: Completely asynchronous agreement protocols. In Proceedings of the 2nd ACM Symposium on Principles of Distributed Computing, pages 27--30, August 1983.
[6]
David Cavin, Yoav Sasson, and André Schiper. Consensus with unknown participants or fundamental self-organization. In Ioanis Nikolaidis, Michel Barbeau, and Evangelos Kranakis, editors, ADHOC-NOW, volume 3158 of Lecture Notes in Computer Science, pages 135--148. Springer, 2004.
[7]
Tushar Deepak Chandra, Vassos Hadzilacos, and Sam Toueg. The weakest failure detector for solving consensus. Journal of the ACM, 43(4):685--722, July 1996.
[8]
Tushar Deepak Chandra and Sam Toueg. Unreliable failure detectors for reliable distributed systems. Journal of the ACM, 43(2):225--267, March 1996.
[9]
Carole Delporte-Gallet, Hugues Fauconnier, and Rachid Guerraoui. Shared memory vs. message passing. Technical Report IC/2003/77, EPFL, December 2003. Availabe at http://icwww.epfl.ch/publications/.
[10]
Carole Delporte-Gallet, Hugues Fauconnier, and Rachid Guerraoui. (almost) all objects are universal in message passing systems. In Pierre Fraigniaud, editor, DISC, volume 3724 of Lecture Notes in Computer Science, pages 184--198. Springer, 2005.
[11]
Carole Delporte-Gallet, Hugues Fauconnier, Rachid Guerraoui, Vassos Hadzilacos, Petr Kouznetsov, and Sam Toueg. The weakest failure detectors to solve certain fundamental problems in distributed computing. In Soma Chaudhuri and Shay Kutten, editors, PODC, pages 338--346. ACM, 2004.
[12]
Antonio Fernández, Ernesto Jiménez, and Michel Raynal. Eventual leader election with weak assumptions on initial knowledge, communication reliability, and synchrony. In DSN, pages 166--178. IEEE Computer Society, 2006.
[13]
Michael J. Fischer, Nancy A. Lynch, and Michael S. Paterson. Impossibility of distributed consensus with one faulty process. Journal of the ACM, 32(2):374--382, April 1985.
[14]
Fabíola Greve and Sébastien Tixeuil. Knowledge connectivity vs. synchrony requirements for fault-tolerant agreement in unknown networks. In DSN, pages 82--91. IEEE Computer Society, 2007.
[15]
Vassos Hadzilacos and Sam Toueg. Fault-tolerant broadcasts and related problems. In Sape J. Mullender, editor, Distributed Systems, chapter 5, pages 97--145. Addison-Wesley, 1993.
[16]
Maurice Herlihy and Jeannette M. Wing. Linearizability: a correctness condition for concurrent objects. ACM Transactions on Programming Languages and Systems, 12(3):463--492, June 1990.
[17]
A. Israeli and M. Li. Bounded time-stamps. Distributed Computing, 6(4):205--209, July 1993.
[18]
Wai-Kau Lo and Vassos Hadzilacos. Using failure detectors to solve consensus in asynchronous shared memory systems. In G. Tel and P. Vitányi, editors, Proceedings of the eighth International Workshop on Distributed Algorithms, volume 857 of LNCS, pages 280--295. Springer-Verlag, September 1994.
[19]
Nancy A. Lynch. Distributed Algorithms. Morgan Kaufmann, 1996.
[20]
Achour Mostéfaoui and Michel Raynal. Solving consensus using Chandra-Toueg's unreliable failure detectors: A general quorum based approach. In Proceedings of the 13th International Symposium on Distributed Computing, LNCS 1693, pages 49--63. Springer-Verlag, September 1999.
[21]
P. Vitanyi and B. Awerbuch. Atomic shared register access by asynchronous hardware. In Proceedings of the 27th Symposium on Foundations of Computer Science, pages 233--246, 1986.

Recommendations

Reviews

Bruce E. Litow

This paper investigates three classical problems in distributed computation-leader election, atomic registers, and consensus-under carefully defined assumptions about synchrony/asynchrony and knowledge about the number of processes. Before proceeding with the review of this interesting paper, I should point out that apart from its abstract and algorithms that are in English, the text of the paper is in French. The paper considers the possibility of each of these three classical distributed computations, when the number of processors is unknown but each process has a unique ID. The authors consider three things: (1)Synchronous systems, where a central source that synchronizes process clocks and communication is synchronous (carried out in rounds with an upper bound on message delay). (2)Partially synchronous systems, where the central clock exists but communication is asynchronous. (3)Asynchronous systems, where no assumption other than ultimate delivery of a message is made; this assumption can be changed if faults are admitted. The authors derive possibility/impossibility results for all cases, also allowing or disallowing faults. The results are summarized in Table 1. A notion of reducibility is given. The idea of a detector is introduced, where a detector has as output a set of process IDs. A detector Y is weaker than a detector ω if every output of ω can be transformed into an output of Y . The authors do not specify details of the transformation, but it appears to amount to simple reinterpretation. This notion of reducibility is applied to two detectors, O and S . O has an output that is the ID of a single process. This output is invariable and is known to all processes. In effect, O elects a leader. S is fault detection by quorum, where two quorums, at distinct times, have a nonempty intersection and, at some time, the corresponding quorum contains the ID of a faulty process. It is shown that for the asynchronous case, S is the weakest detector, enabling realization of atomic registers, and O×S (ordered pair of IDs as output) is the weakest detector, enabling realization of consensus. Online Computing Reviews Service

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
NOTERE '08: Proceedings of the 8th international conference on New technologies in distributed systems
June 2008
399 pages
ISBN:9781595939371
DOI:10.1145/1416729
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • Lyon 1 University
  • SIGAPP: ACM Special Interest Group on Applied Computing
  • Mairie de Villeurbanne
  • Conseil Général du Rhône
  • INSA Lyon: Institut National des Sciences Appliquées de Lyon
  • Conseil Régional Rhône-Alpes
  • Mutuelle d'assurance MAIF
  • I.U.T.A LYON 1: Institute of Technology Lyon 1
  • Ministère de l'Enseignement Supérieur et de la Recherche
  • Lyon 2 University
  • ISTASE: High-Level Engineering School in Telecommunication
  • France Telecom
  • LIRIS: Lyon Research Center for Images and Intelligent Information Systems

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 June 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. distributed algorithms
  2. dynamic systems
  3. fault tolerance

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 83
    Total Downloads
  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media