|
Abstract:
In this paper, we describe a method of execution retry for bypassing software errors based on checkpointing, rollback, message reordering and replaying. We demonstrate how rollback techniques, previously developed for transient hardware failure recovery, can also be used to recover from software faults by exploiting message reordering to bypass software errors. Our approach intentionally increases the degree of nondeterminism and the scope of rollback when a previous retry fails. Examples from our experience with telecommunications software systems illustrate the benefits of the scheme.... Distributed systems, Software boundary errors, Checkpointing and message logging, Message reordering, Progressive retry.
| Limitations: |
APPROVED FOR PUBLIC RELEASE |
| Pages: |
19 |
| Report Date: |
93 |
| Contract Number: |
N00014-91-J-1283 |
| Report Number: |
A570062 |
|
|
|
|
|