ADDING TEMPORARY MEMORY TO ZCS

Authors
Citation
D. Cliff et S. Ross, ADDING TEMPORARY MEMORY TO ZCS, Adaptive behavior, 3(2), 1994, pp. 101-150
Citations number
33
Categorie Soggetti
Social, Sciences, Interdisciplinary",Psychology
Journal title
ISSN journal
10597123
Volume
3
Issue
2
Year of publication
1994
Pages
101 - 150
Database
ISI
SICI code
1059-7123(1994)3:2<101:ATMTZ>2.0.ZU;2-#
Abstract
In a recent article, Wilson (1994) described a ''zeroth-level'' classi fier system (ZCS). ZCS employs a reinforcement learning technique comp arble to Q-learning (Watkins, 1989). This article presents results fro m the first reconstruction of ZCS. Having replicated Wilson's results, we extend ZCS in a manner suggested by Wilson: The original formulati on of ZCS has no memory mechanisms, but Wilson (1994b) suggested how i nternal ''temporary memory'' registers could be added. We show results from adding one-bit and two-bit memory registers to ZCS. Our results demonstrate that ZCS can exploit memory facilities efficiently in non- Markov environments. We also show that the memoryless ZCS can converge on near-optimal stochastic solutions in non-Markov environments. We t hen present results from trials using ZCS in Markov environments that require increasingly long chains of actions before reward is received. Our results indicate that inaccurate overgeneral classifiers can inte ract with the classifier-generation mechanisms to cause catastrophic b reakdowns in overall system performance. Basing classifier fitness on accuracy may alleviate this problem. We conclude that the memory mecha nism in its current form is unlikely to scale well for situations requ iring large amounts of temporary memory. Nevertheless, the abiliity to find stochastic solutions when there is insufficient memory might off set this problem somewhat.