Goal directed adaptive behavior in second-order neural networks: The MAXSON family of architectures

Citation
Fl. Crabbe et Mg. Dyer, Goal directed adaptive behavior in second-order neural networks: The MAXSON family of architectures, ADAPT BEHAV, 8(2), 2001, pp. 149-172
Citations number
37
Categorie Soggetti
Psycology
Journal title
ADAPTIVE BEHAVIOR
ISSN journal
10597123 → ACNP
Volume
8
Issue
2
Year of publication
2001
Pages
149 - 172
Database
ISI
SICI code
1059-7123(200121)8:2<149:GDABIS>2.0.ZU;2-W
Abstract
The paper presents a neural network architecture (MAXSON) based on second-o rder connections that can learn a multiple goal approach/avoid task using r einforcement from the environment. It also enables an agent to learn vicari ously, from the successes and failures of other agents. The paper shows tha t MAXSON can learn certain spatial navigation tasks much faster than tradit ional Q-learning, as well as learn goal directed behavior, increasing the a gent's chances of long-term survival. The paper shows that an extension of MAXSON (V-MAXSON) enables agents to learn vicariously, and this improves th e overall survivability of the agent population.