Probability models and statistical methods are a popular technique for
evaluating the reliability of computer software. This paper reviews t
he literature concerning these methods, with an emphasis on the histor
ical perspective. The use of stochastic techniques is justified, and t
he various probability models that have been proposed, along with any
associated statistical estimation and inference procedures, are descri
bed. Examples of the models applied to real software failure data are
given. A classic software development problem-how long software should
be tested before it is released into the marketplace-is analyzed from
a decision theoretic standpoint, Finally, the direction of future res
earch is contemplated.