Self-stabilizing Byzantine-Tolerant Distributed Replicated State Machine
Autor: | Leonid Yankulin, Mohammed Kassi-Lahlou, Marc Lacoste, Thierry Coupaye, Alexander Binun, Reuven Yagel, Shlomi Dolev, Alex Palesandro |
---|---|
Rok vydání: | 2016 |
Předmět: |
State machine replication
business.industry Computer science Distributed computing Node (networking) 020207 software engineering Fault tolerance Cloud computing 0102 computer and information sciences 02 engineering and technology computer.software_genre 01 natural sciences 010201 computation theory & mathematics Paxos Virtual machine 0202 electrical engineering electronic engineering information engineering Transient (computer programming) State (computer science) business computer |
Zdroj: | Lecture Notes in Computer Science ISBN: 9783319492582 SSS |
DOI: | 10.1007/978-3-319-49259-9_4 |
Popis: | Replicated state machine is a fundamental concept used for obtaining fault tolerant distributed computation. Legacy distributed computational architectures (such as Hadoop or Zookeeper) are designed to tolerate crashes of individual machines. Later, Byzantine fault-tolerant Paxos as well as self-stabilizing Paxos were introduced. Here we present for the first time the self-stabilizing Byzantine fault-tolerant version of a distributed replicated machine. It can cope with any adversarial takeover on less than one third of the participating replicas. It also ensures automatic recovery following any transient violation of the system state, in particular after periods in which more than one third of the participants are Byzantine. A prototype of self-stabilizing Byzantine-tolerant replicated Hadoop master node has been implemented. Experiments show that fully distributed recovery of cloud infrastructures against Byzantine faults can be made practical when relying on self-stabilization in local nodes. Thus automated cloud protection against a wide variety of faults and attacks is possible. |
Databáze: | OpenAIRE |
Externí odkaz: |