TY - GEN
T1 - Self-stabilizing byzantine-tolerant distributed replicated state machine
AU - Binun, Alexander
AU - Coupaye, Thierry
AU - Dolev, Shlomi
AU - Kassi-Lahlou, Mohammed
AU - Lacoste, Marc
AU - Palesandro, Alex
AU - Yagel, Reuven
AU - Yankulin, Leonid
N1 - Funding Information:
The research was partially supported by the Rita Altura Trust Chair in Computer Sciences, Orange Labs under external research contract number 0050012310-C04021, grant of the Ministry of Science, Technology and Space, Israel, and the National Science Council (NSC) of Taiwan, and a grant of the Ministry of Science, Technology and Space, Israel, the Ministry of Foreign Affairs, Italy.
Publisher Copyright:
© Springer International Publishing AG 2016.
PY - 2016/1/1
Y1 - 2016/1/1
N2 - Replicated state machine is a fundamental concept used for obtaining fault tolerant distributed computation. Legacy distributed computational architectures (such as Hadoop or Zookeeper) are designed to tolerate crashes of individual machines. Later, Byzantine fault-tolerant Paxos as well as self-stabilizing Paxos were introduced. Here we present for the first time the self-stabilizing Byzantine fault-tolerant version of a distributed replicated machine. It can cope with any adversarial takeover on less than one third of the participating replicas. It also ensures automatic recovery following any transient violation of the system state, in particular after periods in which more than one third of the participants are Byzantine. A prototype of self-stabilizing Byzantine-tolerant replicated Hadoop master node has been implemented. Experiments show that fully distributed recovery of cloud infrastructures against Byzantine faults can be made practical when relying on self-stabilization in local nodes. Thus automated cloud protection against a wide variety of faults and attacks is possible.
AB - Replicated state machine is a fundamental concept used for obtaining fault tolerant distributed computation. Legacy distributed computational architectures (such as Hadoop or Zookeeper) are designed to tolerate crashes of individual machines. Later, Byzantine fault-tolerant Paxos as well as self-stabilizing Paxos were introduced. Here we present for the first time the self-stabilizing Byzantine fault-tolerant version of a distributed replicated machine. It can cope with any adversarial takeover on less than one third of the participating replicas. It also ensures automatic recovery following any transient violation of the system state, in particular after periods in which more than one third of the participants are Byzantine. A prototype of self-stabilizing Byzantine-tolerant replicated Hadoop master node has been implemented. Experiments show that fully distributed recovery of cloud infrastructures against Byzantine faults can be made practical when relying on self-stabilization in local nodes. Thus automated cloud protection against a wide variety of faults and attacks is possible.
UR - http://www.scopus.com/inward/record.url?scp=84995632337&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-49259-9_4
DO - 10.1007/978-3-319-49259-9_4
M3 - Conference contribution
AN - SCOPUS:84995632337
SN - 9783319492582
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 36
EP - 53
BT - Stabilization, Safety, and Security of Distributed Systems - 18th International Symposium, SSS 2016, Proceedings
A2 - Petit, Franck
A2 - Bonakdarpour, Borzoo
PB - Springer Verlag
T2 - 18th International Symposium on Stabilization, Safety, and Security of Distributed Systems, SSS 2016
Y2 - 7 November 2016 through 10 November 2016
ER -