Exploiting deferred destruction

Exploiting deferred destruction: an analysis of read-copy-update techniques in operating system kernels

January 2004

Author:
Paul E. Mckenney,
Supervisor:
Jonathan Walpole

Publisher:

Oregon Health & Science University

Order Number:AAI3139819

Pages:

358

Purchase on ProQuest

Bibliometrics

Abstract

The Moore's-Law-driven performance of simple instructions has improved by orders of magnitude over the past two decades, but shared-memory multiprocessor (SMMP) synchronization operations have not kept pace. SMMP software uses synchronization operations heavily, thus suffering degraded performance and scalability. As a result, many traditional SMMP algorithms are now obsolete.

This dissertation presents read-copy update (RCU), a reader-writer synchronization mechanism in which read-side critical sections incur virtually zero synchronization overhead, thereby achieving near-ideal performance for read-mostly workloads. Write-side critical sections incur substantial synchronization overhead, deferring destruction and maintaining multiple versions of data structures in order to accommodate the synchronization-free read-side critical sections. In addition, writers use some mechanism, such as locking, to ensure orderly updates.

Readers provide a signal enabling writers to determine when it is safe to complete destructive operations, but this signal may be deferred, permitting a single signal operation to serve multiple read-side RCU critical sections.

These read-side signals are observed by a specialized garbage collector, which carries out destructive operations once it is safe to do so. Garbage collectors are typically implemented in a manner similar to a barrier computation. Production-quality garbage collectors batch destructive operations, amortizing signal-observation overhead over many updates.

Although RCU is not itself new, its use has been quite specialized. This dissertation rectifies this situation by showing how RCU can be implemented efficiently in operating system kernels, by demonstrating its system-level performance and complexity benefits, and by providing a set of design patterns that make RCU more generally applicable.

This dissertation compares RCU to traditional synchronization mechanisms, including locking and non-blocking synchronization, using both analytic and empirical methods. The empirical methods include both informal micro-benchmarks and formal system-level benchmarks. These benchmarks show performance benefits ranging from tens of percent to an order of magnitude and little or no increase in code complexity.

Finally, this dissertation demonstrates that RCU has practical value by (1) outlining its use in several production systems, two of which have seen extensive datacenter use, one of which this author designed and implemented, and (2) documenting its widespread use in the Linux 2.6 kernel.

Cited By

Contributors

Paul E McKenney
Facebook, Inc.
- Publication Years1981 - 2020
- Publication counts39
- Citation count678
- Available for Download25
- Downloads (cumulative)71,866
- Downloads (12 months)4,995
- Downloads (6 weeks)556
- Average Downloads per Article2,875
- Average Citation per Article17
View Full Profile
Jonathan Walpole
Portland State University
- Publication Years1992 - 2012
- Publication counts70
- Citation count1,674
- Available for Download32
- Downloads (cumulative)21,621
- Downloads (12 months)979
- Downloads (6 weeks)178
- Average Downloads per Article676
- Average Citation per Article24
View Full Profile

Index Terms

Exploiting deferred destruction: an analysis of read-copy-update techniques in operating system kernels
1. Software and its engineering
  1. Software organization and properties
    1. Contextual software domains
      1. Operating systems
        Memory management
        Garbage collection
        Process management
        Multiprocessing / multiprogramming / multitasking
        Process synchronization

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Safety of Deferred Update in Transactional Memory
ICDCS '13: Proceedings of the 2013 IEEE 33rd International Conference on Distributed Computing Systems

Transactional memory allows the user to declare sequences of instructions as speculative transactions that can either commit or abort. If a transaction commits, it appears to be executed sequentially, so that the committed transactions constitute a ...
Deferred Runtime Pipelining for contentious multicore software transactions
EuroSys '19: Proceedings of the Fourteenth EuroSys Conference 2019

DRP is a new concurrency control protocol for software transactional memory that achieves high throughput, even for skewed workloads that exhibit high contention. DRP builds on prior works that chop transactions into pieces to expose more concurrency ...
Speculative client execution in deferred update replication
MW4NG '14: Proceedings of the 9th Workshop on Middleware for Next Generation Internet Computing

Deferred Update Replication (DUR) is a powerful replication technique that allows parallelism of clients' execution while a global certification phase checks the validity of the transactional execution against workloads running on remote nodes. The well-...

Browse Theses

Sections

Cited By

Index Terms

Safety of Deferred Update in Transactional Memory

Deferred Runtime Pipelining for contentious multicore software transactions

Speculative client execution in deferred update replication

Sections

Cited By

Save to Binder

Index Terms

Recommendations

Safety of Deferred Update in Transactional Memory

Deferred Runtime Pipelining for contentious multicore software transactions

Speculative client execution in deferred update replication