Storming Media: Pentagon Reports and DocumentsPentagon Reports: Fast. Definitive. Complete.     
New Account »
Forgot Password?
Advanced Search »
ComputersComputer Programming and Software

Progressive Retry for Software Error Recovery in Distributed Systems

Authors: Yi-Min Wang; Yennun Huang; W. K. Fuchs; ILLINOIS UNIV AT URBANA COORDINATED SCIENCE LAB
Abstract:
In this paper, we describe a method of execution retry for bypassing software errors based on checkpointing, rollback, message reordering and replaying. We demonstrate how rollback techniques, previously developed for transient hardware failure recovery, can also be used to recover from software faults by exploiting message reordering to bypass software errors. Our approach intentionally increases the degree of nondeterminism and the scope of rollback when a previous retry fails. Examples from our experience with telecommunications software systems illustrate the benefits of the scheme.... Distributed systems, Software boundary errors, Checkpointing and message logging, Message reordering, Progressive retry.

Limitations: APPROVED FOR PUBLIC RELEASE
Pages: 19
Report Date: 93
Contract Number: N00014-91-J-1283
Report Number: A570062
Keywords relating to this report:
*COMPUTER PROGRAMS
*DISTRIBUTED DATA PROCESSING
*ERRORS
*RECOVERY
FAILURE
FAULT TOLERANT COMPUTING
MESSAGE PROCESSING
TELECOMMUNICATIONS
Email This Abstract