Yang et al., 2017 - Google Patents
Exploring the Challenges and Opportunities of Cloud Stacks in Dynamic Resource EnvironmentsYang et al., 2017
- Document ID
- 12474657177346640626
- Author
- Yang F
- Gunawi H
- Chien A
- Publication year
- Publication venue
- 2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)
External Links
Snippet
Traditional cloud stacks are designed to tolerate server or rack-level failures, that are unpredictable and uncorrelated. Such stacks successfully deliver highly-available cloud services at global scale. The increasing criticality of cloud services to the overall world …
- 230000002596 correlated 0 abstract description 26
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
- G06F11/2097—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements maintaining the standby controller/processing unit updated
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
- G06F11/202—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
- G06F11/2023—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant details of failing over
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
- G06F11/14—Error detection or correction of the data by redundancy in operation
- G06F11/1402—Saving, restoring, recovering or retrying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/1658—Data re-synchronization of a redundant component, or initial sync of replacement, additional or spare unit
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30575—Replication, distribution or synchronisation of data between databases or within a distributed database; Distributed database system architectures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
- G06F11/14—Error detection or correction of the data by redundancy in operation
- G06F11/1479—Generic software techniques for error detection or fault masking
- G06F11/1482—Generic software techniques for error detection or fault masking by means of middleware or OS functionality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3065—Monitoring arrangements determined by the means or processing involved in reporting the monitored data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2201/00—Indexing scheme relating to error detection, to error correction, and to monitoring
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network-specific arrangements or communication protocols supporting networked applications
- H04L67/10—Network-specific arrangements or communication protocols supporting networked applications in which an application is distributed across nodes in the network
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Netto et al. | State machine replication in containers managed by Kubernetes | |
Bravo et al. | Saturn: A distributed metadata service for causal consistency | |
Almeida et al. | ChainReaction: a causal+ consistent datastore based on chain replication | |
He et al. | Hog: Distributed hadoop mapreduce on the grid | |
CN106993064A (en) | A kind of system and its construction method and application that the storage of mass data scalability is realized based on Openstack cloud platforms | |
Aldwyan et al. | Latency-aware failover strategies for containerized web applications in distributed clouds | |
Costa et al. | Medusa: An efficient cloud fault-tolerant mapreduce | |
Cidon et al. | MinCopysets: Derandomizing replication in cloud storage | |
Wang et al. | Exploring the design tradeoffs for extreme-scale high-performance computing system software | |
Kumar T et al. | Intelligent Fault‐Tolerant Mechanism for Data Centers of Cloud Infrastructure | |
Du et al. | Cost-effective strong consistency on scalable geo-diverse data replicas | |
Jiang et al. | A novel clustered MongoDB-based storage system for unstructured data with high availability | |
Liu et al. | Replication in distributed storage systems: State of the art, possible directions, and open issues | |
Kolbeck et al. | Flease-lease coordination without a lock server | |
Zhou et al. | FTCloudSim: support for cloud service reliability enhancement simulation | |
Costa et al. | Chrysaor: Fine-grained, fault-tolerant cloud-of-clouds mapreduce | |
Chen et al. | Replication-based highly available metadata management for cluster file systems | |
Denzler et al. | Comparing different persistent storage approaches for containerized stateful applications | |
Yang et al. | Exploring the Challenges and Opportunities of Cloud Stacks in Dynamic Resource Environments | |
Wan et al. | Dual-JT: Toward the high availability of JobTracker in Hadoop | |
Jiang et al. | MyStore: A high available distributed storage system for unstructured data | |
Beineke et al. | Fast parallel recovery of many small in-memory objects | |
Yang | Resilient Distributed Data Management Protocols in Dynamic Resource Environments | |
Xiao et al. | AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes | |
Shen | Distributed storage system model design in internet of things based on hash distribution |