Marriott, S. and Harrison, R.F. (1995) A Self-Organising State Space Decoder for Reinforcement Learning. Research Report. ACSE Research Report 569 . Department of Automatic Control and Systems Engineering
Abstract
A self-organising architecture, loosely based upon a particular implementation of adaptive resonance theory ( ART) is used here as an alternative to the fixed decoder in the seminal implementation of reinforcement learning of Barto, Sutton and Anderson (BSA). The cart-pole problem is considered and the results are compared to those of the original study. The objective is to illustrate the possibility of controllers that partition state space through experience without the need for a priori knowledge. Input/output pattern pairs, desired state space regions and the network size/topology are not known in advance. Results show that, although learning is not smooth, the reinforcement learning implementation considered here is successful and learns a meaningful control mapping. The adaptive search element and the adaptive critic element of the original (BSA) study are retained.
Metadata
Item Type: | Monograph |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | The Department of Automatic Control and Systems Engineering research reports offer a forum for the research output of the academic staff and research students of the Department at the University of Sheffield. Papers are reviewed for quality and presentation by a departmental editor. However, the contents and opinions expressed remain the responsibility of the authors. Some papers in the series may have been subsequently published elsewhere and you are advised to cite the later published version in these instances. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Automatic Control and Systems Engineering (Sheffield) > ACSE Research Reports |
Depositing User: | MRS ALISON THERESA BARNETT |
Date Deposited: | 04 Aug 2014 08:21 |
Last Modified: | 03 Nov 2016 02:49 |
Status: | Published |
Publisher: | Department of Automatic Control and Systems Engineering |
Series Name: | ACSE Research Report 569 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:80000 |