Nagawiecki et al., 2021 - Google Patents

Layered Paxos: A Hierarchical Approach to Consensus

Nagawiecki et al., 2021

Document ID: 14765502707719280466
Author: Nagawiecki A; Patterson S
Publication year: 2021
Publication venue: Rensselaer Polytechnic Institute

External Links

Cited by

Snippet

We present a new consensus algorithm for use in distributed systems. The algorithm, Layered Paxos, is designed for hierarchical systems where processes can be grouped into disjoint components. The underlying communication network is assumed to be two-fold with …

Continue reading at nsl.cs.rpi.edu (PDF) (other versions)

238000013459 approach 0 title description 2

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
- G06F11/202—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
- G06F11/2023—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant details of failing over
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
- G06F11/2097—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements maintaining the standby controller/processing unit updated
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
- G06F11/202—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
- G06F11/2035—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant without idle spare hardware
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/1658—Data re-synchronization of a redundant component, or initial sync of replacement, additional or spare unit
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
- G06F11/14—Error detection or correction of the data by redundancy in operation
- G06F11/1402—Saving, restoring, recovering or retrying
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
- G06F11/0703—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
- G06F11/0751—Error or fault detection not based on redundancy
- G06F11/0754—Error or fault detection not based on redundancy by exceeding limits
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/52—Programme synchronisation; Mutual exclusion, e.g. by means of semaphores; Contention for resources among tasks
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2201/00—Indexing scheme relating to error detection, to error correction, and to monitoring
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2209/00—Indexing scheme relating to G06F9/00
- G06F2209/50—Indexing scheme relating to G06F9/50
- G06F2209/505—Clust
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes

Similar Documents

Publication	Publication Date	Title
Kogias et al.	2020	HovercRaft: Achieving scalability and fault-tolerance for microsecond-scale datacenter services
US7814373B2 (en)	2010-10-12	Scalable method of continuous monitoring the remotely accessible resources against node failures for very large clusters
Charapko et al.	2021	Pigpaxos: Devouring the communication bottlenecks in distributed consensus
US20150161016A1 (en)	2015-06-11	Method and system of self-managing nodes of a distributed database cluster with a consensus algorithm
US12111817B2 (en)	2024-10-08	Log execution method and apparatus, computer device and storage medium
Charapko et al.	2019	Linearizable quorum reads in Paxos
Benz et al.	2014	Building global and scalable systems with atomic multicast
Yu et al.	2005	Consistent and automatic replica regeneration
US6873987B1 (en)	2005-03-29	Method, system and program products for recovering from failures within a shared nothing distributed computing environment
Geng et al.	2022	Nezha: Deployable and high-performance consensus using synchronized clocks
Ricciardi et al.	1993	Process membership in asynchronous environments
Nagawiecki et al.	2021	Layered Paxos: A Hierarchical Approach to Consensus
van Renesse et al.	2010	Replication techniques for availability
Ng	1990	The design and implementation of a reliable distributed operating system-ROSE
Liu et al.	2016	D-Paxos: building hierarchical replicated state machine for cloud environments
Sun et al.	2017	Adaptive trade‐off between consistency and performance in data replication
Ling et al.	2003	A self-tuning, self-protecting, self-healing session state management layer
Xia et al.	2011	A Novel Failure Detection Algorithm for Reliable Distributed Systems.
Charapko et al.	2020	Scaling Strongly Consistent Replication
Birman et al.	2012	Overcoming failures in a distributed system
Gupta et al.	2021	Failure Detection and Fault-Tolerance for Key-Value store in Distributed Systems
Kazhamiaka	2019	Sift: Achieving Resource-Efficient Consensus with RDMA
Gonçalves	2024	Driftwood: decentralized Raft consensus
Ding et al.	2019	Testing Raft-Replicated Database Systems
Lugano et al.	2006	A pragmatic protocol for database replication in interconnected clusters