Title | Practical Hardening of Crash-Tolerant Systems |
Publication Type | Conference Paper |
Year of Publication | 2012 |
Authors | Correia, M, Ferro DG, Junqueira F, Serafini M |
Conference Name | Proc. of the 2012 USENIX Annual Technical Conference |
Date Published | June/2012 |
Publisher | USENIX |
Abstract | Recent failures of production systems have highlighted the importance of tolerating faults beyond crashes. The industry has so far addressed this problem by hardening crash-tolerant systems with ad hoc error detection checks, potentially overlooking critical fault scenarios. We propose a generic and principled hardening technique for Arbitrary State Corruption (ASC) faults, which specifically model the effects of realistic data corruptions on distributed processes. Hardening does not require the use of trusted components or the replication of the process over multiple physical servers. We implemented a wrapper library to transparently harden distributed processes. To exercise our library and evaluate our technique, we obtained ASC-tolerant versions of Paxos, of a subset of the ZooKeeper API, and of an eventually consistent storage by implementing crash-tolerant protocols and automatically hardening them using our library. Our evaluation shows that the throughput of our ASC-hardened state machine replication outperforms its Byzantine-tolerant counterpart by up to 70%. |
URL | https://www.usenix.org/conference/usenixfederatedconferencesweek/practical-hardening-crash-tolerant-systems |
Practical Hardening of Crash-Tolerant Systems
- Login to post comments