White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

The application of adaptive resonance theory and reinforcement learning to mapping and control.

Marriot, Shaun (1996) The application of adaptive resonance theory and reinforcement learning to mapping and control. PhD thesis, University of Sheffield.

[img] Text (242276.pdf)

Download (23Mb)


In this thesis, the ideas of Adaptive Resonance Theory (ART) and Reinforcement Learning (RL) are applied to the problems of mapping and control. A neural architecture, fuzzy ARTMAP is considered as an alternative to standard feedforward networks for noisy mapping tasks. It is one of a series of architectures based upon ART. Fuzzy ARTMAP has advantages over feedforward networks--such as increased autonomy- and is especially suited to classification-type problems. Here it is used to estimate a continuous mapping from noisy data. Results show that properties useful for classification problems are not necessarily advantageous for noisy mapping problems. One particular feature is found to cause specialisation to the data. A modified variant is proposed which stores probability information in a sub-unit of the architecture. The proposed fuzzy ARTMAP variant is found to outperform fuzzy ARTMAP in a mapping task. Another novel self-organising architecture, loosely based upon a particular implementation of ART, is proposed here as an alternative to the fixed state-space decoder in a seminal implementation of reinforcement learning. A well-known non-linear control problem is considered. Input / output pattern pairs, desired state-space regions and the network size / topology are not known in advance. Results show that, although learning is not smooth, the novel ART-based RL implementation is successful and develops a meaningful control mapping. The new decoder increases its information capacity as necessary and indicates that such a self-organising approach to control is viable. The self-organising properties of the new decoder allow the neurocontroller to retain previously learned information and to adapt to newly encountered states throughout its operation, on-line. A fuzzy version of the original RL implementation is implemented to investigate the possibility of distributing control information across more than one state-space region. The fuzzy version is found to outperform the original RL implementation in a control task.

Item Type: Thesis (PhD)
Keywords: Control systems & control theory
Academic Units: The University of Sheffield > Faculty of Engineering (Sheffield) > Automatic Control and Systems Engineering (Sheffield)
Identification Number/EthosID: uk.bl.ethos.242276
Depositing User: EThOS Import Sheffield
Date Deposited: 30 Jun 2014 13:28
Last Modified: 30 Jun 2014 13:28
URI: http://etheses.whiterose.ac.uk/id/eprint/5974

You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.

Actions (repository staff only: login required)