J. Cagan et K. Kotovsky, SIMULATED ANNEALING AND THE GENERATION OF THE OBJECTIVE FUNCTION - A MODEL OF LEARNING DURING PROBLEM-SOLVING, Computational intelligence, 13(4), 1997, pp. 534-581
A computational model of problem solving based on significant aspects
of human problem solving is introduced. It is observed that during pro
blem solving humans often start searching more or less randomly, becom
ing more deterministic over time as they learn more about the problem.
This two-phase aspect of problem-solving behavior and its relation to
learning is one of the important features this model accounts for. Th
e model uses an accelerated simulated annealing technique as a search
mechanism within a real-time dynamic programming-like framework upon a
connected graph of neighboring problem states. The objective value of
each node is adjusted as the model moves between nodes, learning more
accurate values for the nodes and also compensating for misleading he
uristic information as it does so. In this manner the model is shown t
o learn to more effectively solve isomorphs of the Balls and Boxes and
Tower of Hanoi problems. The major issues investigated with the model
are (a) whether such a simulated annealing-based model exhibits the k
ind of random-to-directed transition in behavior exhibited by people,
and (b) whether the progressive discovery of the objective function, e
ven when given very little or poor initial information, is a plausible
method for representing the learning that occurs during problem solvi
ng and the knowledge that results from that learning.