Process checkpointing is a procedure which periodically saves the process s
tates into stable storage. Most checkpointing facilities select hard disks
for archiving. However, the disk seek time is limited by the speed of the r
ead-write heads, thus checkpointing process into a local disk requires exte
nsive disk bandwidth. In this paper, we propose an approach that exploits t
he memory on idle workstations as a faster storage for checkpointing. In ou
r scheme, autonomous machines which submit jobs to the computation server o
ffer their physical memory to the server for job checkpointing. Eight appli
cations are used to measure the remote memory performance in four checkpoin
ting policies. Experimental results show that remote memory reduces at leas
t 34.5 per cent of the overhead for sequential checkpointing and 32.1 per c
ent for incremental checkpointing. Additionally, to checkpoint a running pr
ocess into a remote memory requires only 60 per cent of the local disk chec
kpoint latency time. Copyright (C) 1999 John Wiley & Sons, Ltd.