TY - GEN
T1 - Self-stabilizing Multivalued Consensus in Asynchronous Crash-prone Systems
AU - Lundstrom, Oskar
AU - Raynal, Michel
AU - Schiller, Elad M.
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/1/1
Y1 - 2021/1/1
N2 - The problem of multivalued consensus is fundamental in the area of fault-tolerant distributed computing since it abstracts a very broad set of agreement problems in which processes have to uniformly decide on a specific value v\in V, where V ≥ 2. Existing solutions (that tolerate process failures) reduce the multivalued consensus problem to the one of binary consensus, e.g., Mostéfaoui-Raynal-Tronel and Zhang-Chen. Our study aims at the design of an even more reliable solution. We do so through the lenses of self-stabilization-a very strong notion of fault-tolerance. In addition to node and communication failures, self-stabilizing algorithms can recover after the occurrence of (a finite number of) arbitrary transient-faults; these faults represent any violation of the assumptions according to which the system was designed to operate (as long as the algorithm code stays intact). This work proposes the first (to the best of our knowledge) self-stabilizing algorithm for multivalued consensus for asynchronous message-passing systems prone to process failures and arbitrary transient-faults. Our solution is also the first (to the best of our knowledge) to support wait-freedom. Moreover, using piggybacking techniques, our solution can invoke n binary consensus objects concurrently. Thus, the proposed self-stabilizing wait-free solution can terminate using fewer binary consensus objects than earlier non-self-stabilizing solutions by Mostéfaoui, Raynal, and Tronel, which uses an unbounded number of binary consensus objects, or Zhang and Chen, which is not wait-free.
AB - The problem of multivalued consensus is fundamental in the area of fault-tolerant distributed computing since it abstracts a very broad set of agreement problems in which processes have to uniformly decide on a specific value v\in V, where V ≥ 2. Existing solutions (that tolerate process failures) reduce the multivalued consensus problem to the one of binary consensus, e.g., Mostéfaoui-Raynal-Tronel and Zhang-Chen. Our study aims at the design of an even more reliable solution. We do so through the lenses of self-stabilization-a very strong notion of fault-tolerance. In addition to node and communication failures, self-stabilizing algorithms can recover after the occurrence of (a finite number of) arbitrary transient-faults; these faults represent any violation of the assumptions according to which the system was designed to operate (as long as the algorithm code stays intact). This work proposes the first (to the best of our knowledge) self-stabilizing algorithm for multivalued consensus for asynchronous message-passing systems prone to process failures and arbitrary transient-faults. Our solution is also the first (to the best of our knowledge) to support wait-freedom. Moreover, using piggybacking techniques, our solution can invoke n binary consensus objects concurrently. Thus, the proposed self-stabilizing wait-free solution can terminate using fewer binary consensus objects than earlier non-self-stabilizing solutions by Mostéfaoui, Raynal, and Tronel, which uses an unbounded number of binary consensus objects, or Zhang and Chen, which is not wait-free.
KW - Consensus
KW - Fault-tolerance
KW - Stabilization
UR - http://www.scopus.com/inward/record.url?scp=85123465977&partnerID=8YFLogxK
U2 - 10.1109/EDCC53658.2021.00023
DO - 10.1109/EDCC53658.2021.00023
M3 - Conference contribution
AN - SCOPUS:85123465977
T3 - Proceedings - 2021 17th European Dependable Computing Conference, EDCC 2021
SP - 111
EP - 118
BT - Proceedings - 2021 17th European Dependable Computing Conference, EDCC 2021
PB - Institute of Electrical and Electronics Engineers
T2 - 17th European Dependable Computing Conference, EDCC 2021
Y2 - 13 September 2021 through 16 September 2021
ER -