Self-stabilizing byzantine-tolerant distributed replicated state machine

Alexander Binun, Thierry Coupaye, Shlomi Dolev, Mohammed Kassi-Lahlou, Marc Lacoste, Alex Palesandro, Reuven Yagel, Leonid Yankulin

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Replicated state machine is a fundamental concept used for obtaining fault tolerant distributed computation. Legacy distributed computational architectures (such as Hadoop or Zookeeper) are designed to tolerate crashes of individual machines. Later, Byzantine fault-tolerant Paxos as well as self-stabilizing Paxos were introduced. Here we present for the first time the self-stabilizing Byzantine fault-tolerant version of a distributed replicated machine. It can cope with any adversarial takeover on less than one third of the participating replicas. It also ensures automatic recovery following any transient violation of the system state, in particular after periods in which more than one third of the participants are Byzantine. A prototype of self-stabilizing Byzantine-tolerant replicated Hadoop master node has been implemented. Experiments show that fully distributed recovery of cloud infrastructures against Byzantine faults can be made practical when relying on self-stabilization in local nodes. Thus automated cloud protection against a wide variety of faults and attacks is possible.

Original languageEnglish
Title of host publicationStabilization, Safety, and Security of Distributed Systems - 18th International Symposium, SSS 2016, Proceedings
EditorsFranck Petit, Borzoo Bonakdarpour
PublisherSpringer Verlag
Pages36-53
Number of pages18
ISBN (Print)9783319492582
DOIs
StatePublished - 1 Jan 2016
Event18th International Symposium on Stabilization, Safety, and Security of Distributed Systems, SSS 2016 - Lyon, France
Duration: 7 Nov 201610 Nov 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10083 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference18th International Symposium on Stabilization, Safety, and Security of Distributed Systems, SSS 2016
Country/TerritoryFrance
CityLyon
Period7/11/1610/11/16

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science (all)

Fingerprint

Dive into the research topics of 'Self-stabilizing byzantine-tolerant distributed replicated state machine'. Together they form a unique fingerprint.

Cite this