AN ALGORITHM FOR FINDING THE CAUSAL DISTRIBUTED BREAKPOINT

Citation
T. Masuzawa et N. Tokura, AN ALGORITHM FOR FINDING THE CAUSAL DISTRIBUTED BREAKPOINT, Journal of parallel and distributed computing, 42(1), 1997, pp. 60-66
Citations number
8
Categorie Soggetti
Computer Sciences","Computer Science Theory & Methods
ISSN journal
07437315
Volume
42
Issue
1
Year of publication
1997
Pages
60 - 66
Database
ISI
SICI code
0743-7315(1997)42:1<60:AAFFTC>2.0.ZU;2-Q
Abstract
A causal distributed breakpoint is one of the fundamental mechanisms f or debugging distributed programs. It is initiated by a sequential bre akpoint in one process of a distributed computation, and restores each process to the earliest state that reflects all events that happened causally before the sequential breakpoint. This paper presents an algo rithm for finding the causal distributed breakpoint when a sequential breakpoint occurs, To find the causal distributed breakpoint efficient ly, some information about dependency of events is piggybacked in ever y message and is logged at each process. The algorithm requires O(1) i nformation in each message and finds the causal distributed breakpoint in O(n log n + m) time, where n denotes the number of processes and r n denotes the number of distinct pairs of processes directly communica ting with each other, (C) 1997 Academic Press.