A Caltech Library Service

An Online Algorithm for Checkpointing Placement

Ziv, Avi and Bruck, Jehoshua (1995) An Online Algorithm for Checkpointing Placement. California Institute of Technology . (Unpublished)

PDF (Adobe PDF (1.7MB))
See Usage Policy.

See Usage Policy.


Use this Persistent URL to link to this item:


Checkpointing is a common technique for reducing the time to recover from faults in computer systems. By saving intermediate states of programs in a reliable storage, check pointing enables to reduce the lost processing time caused by faults. The length of the intervals between checkpoints affects the execution time of programs. Long intervals lead to long re-processing time, while too frequent checkpoint- iizg leads to high checkpointing overhead. In this paper we present an on-line algorithm for placement of checkpoints. The algorithm uses on-line knowledge of the current cost of a checkpoint when it decides whether or not to place a checkpoint. We show how the execution time of a program using this algorithm can be analyzed. The total overhead of the execution time when the proposed algorithm is used is smaller than the overhead when fixed intervals are used. Although the proposed algorithm uses only on-line knowledge about the cost of checkpointing, its behavior is close to the off-line optimal algorithm that uses a complete knowledge of checkpointing cost.

Item Type:Report or Paper (Technical Report)
Related URLs:
URLURL TypeDescription
Bruck, Jehoshua0000-0001-8474-0812
Group:Parallel and Distributed Systems Group
Record Number:CaltechPARADISE:1995.ETR006
Persistent URL:
Usage Policy:You are granted permission for individual, educational, research and non-commercial reproduction, distribution, display and performance of this work in any format.
ID Code:26068
Deposited By: Imported from CaltechPARADISE
Deposited On:04 Sep 2002
Last Modified:22 Nov 2019 09:58

Repository Staff Only: item control page