Abstract
We suggest modeling software package flaws (bugs) by assuming eventual Byzantine behavior of the package. We assume that if a program is started in a predefined initial state, it will exhibit legal behavior for a period of time but will eventually become Byzantine. We assume that this behavior pattern can be attributed to the fact that the manufacturer had performed sufficient package tests for limited time scenarios. Restarts are useful for recovering such systems. We suggest a general, yet practical, framework and paradigm for the monitoring and restarting of systems where the framework and paradigm are based on a theoretical foundation. An autonomic recoverer that monitors and initiates system recovery is proposed. It is designed to handle a task, given specific task requirements in the form of predicates and actions. A directed acyclic graph subsystem hierarchical structure is used by a consistency monitoring procedure for achieving a gracious recovery. The existence and correct functionality of the autonomic recovery is guaranteed by the use of a self-stabilizing kernel resident (anchor) process. The autonomic recoverer uses a new scheme for liveness assurance via on-line monitoring that complements known schemes for on-line safety assurance.
Original language | English |
---|---|
Pages (from-to) | 2315-2327 |
Number of pages | 13 |
Journal | Journal of Systems and Software |
Volume | 81 |
Issue number | 12 |
DOIs | |
State | Published - 1 Dec 2008 |
Keywords
- Automatic recovery
- Liveness
- Monitor
- Safety
- Self-stabilization
ASJC Scopus subject areas
- Software
- Information Systems
- Hardware and Architecture