Self-stabilizing byzantine-tolerant distributed replicated state machine

  • Alexander Binun
  • , Thierry Coupaye
  • , Shlomi Dolev
  • , Mohammed Kassi-Lahlou
  • , Marc Lacoste
  • , Alex Palesandro
  • , Reuven Yagel
  • , Leonid Yankulin

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    10 Scopus citations

    Abstract

    Replicated state machine is a fundamental concept used for obtaining fault tolerant distributed computation. Legacy distributed computational architectures (such as Hadoop or Zookeeper) are designed to tolerate crashes of individual machines. Later, Byzantine fault-tolerant Paxos as well as self-stabilizing Paxos were introduced. Here we present for the first time the self-stabilizing Byzantine fault-tolerant version of a distributed replicated machine. It can cope with any adversarial takeover on less than one third of the participating replicas. It also ensures automatic recovery following any transient violation of the system state, in particular after periods in which more than one third of the participants are Byzantine. A prototype of self-stabilizing Byzantine-tolerant replicated Hadoop master node has been implemented. Experiments show that fully distributed recovery of cloud infrastructures against Byzantine faults can be made practical when relying on self-stabilization in local nodes. Thus automated cloud protection against a wide variety of faults and attacks is possible.

    Original languageEnglish
    Title of host publicationStabilization, Safety, and Security of Distributed Systems - 18th International Symposium, SSS 2016, Proceedings
    EditorsFranck Petit, Borzoo Bonakdarpour
    PublisherSpringer Verlag
    Pages36-53
    Number of pages18
    ISBN (Print)9783319492582
    DOIs
    StatePublished - 1 Jan 2016
    Event18th International Symposium on Stabilization, Safety, and Security of Distributed Systems, SSS 2016 - Lyon, France
    Duration: 7 Nov 201610 Nov 2016

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume10083 LNCS
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference18th International Symposium on Stabilization, Safety, and Security of Distributed Systems, SSS 2016
    Country/TerritoryFrance
    CityLyon
    Period7/11/1610/11/16

    ASJC Scopus subject areas

    • Theoretical Computer Science
    • General Computer Science

    Fingerprint

    Dive into the research topics of 'Self-stabilizing byzantine-tolerant distributed replicated state machine'. Together they form a unique fingerprint.

    Cite this