Marriott, S. and Harrison, R.F. (1995) A Self-Organising State Space Decoder for Reinforcement Learning. Research Report. ACSE Research Report 582 . Department of Automatic Control and Systems Engineering
Abstract
A novel self-organising architecture, loosely based upon a particular implementation of adaptive resonance theory is proposed here as an alternative to the fixed state space decoder in the seminal implementation of reinforcement learning of Barto, Sutton and Anderson. A well known non-linear control problem is considered and the results are compared to those of the original study. The objective is to illustrate the possibility of neurocontrollers that adaptively partition state space through experience without the need for a priori knowledge. Input/output pattern pairs, desired state space regions and the network size/topology are not known in advance. Results show that, although learning is not smooth, the novel reinforcement learning implementation introduced here is successful and develops an effective control mapping. The self-organising properties of the new decoder allow the neurocontroller to retain previously learned information and adapt to newly encountered states throughout its operation, on-line. The new decoder increases its information capacity as necessary. The adaptive search element and the adaptive critic element of the original study are retained.
Metadata
Item Type: | Monograph |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | The Department of Automatic Control and Systems Engineering research reports offer a forum for the research output of the academic staff and research students of the Department at the University of Sheffield. Papers are reviewed for quality and presentation by a departmental editor. However, the contents and opinions expressed remain the responsibility of the authors. Some papers in the series may have been subsequently published elsewhere and you are advised to cite the later published version in these instances. |
Keywords: | Adaptive-resonance theory; Fuzzy ARTMAP,; Reinforcement learning; Incremental clustering; Euclidean clustering; Neurocontroller; Self organising systems. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Automatic Control and Systems Engineering (Sheffield) > ACSE Research Reports |
Depositing User: | MRS ALISON THERESA BARNETT |
Date Deposited: | 11 Aug 2014 10:41 |
Last Modified: | 24 Oct 2016 19:28 |
Status: | Published |
Publisher: | Department of Automatic Control and Systems Engineering |
Series Name: | ACSE Research Report 582 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:80095 |