Rudd-Orthner, R. and Mihaylova, L. (2019) Numerical discrimination of the generalisation model from learnt weights in neural networks. In: Proceedings of 2019 International Conference on Computing, Electronics & Communications Engineering (iCCECE). International Conference on Computing, Electronics & Communications Engineering (iCCECE), 22-23 Aug 2019, London, UK. IEEE , pp. 235-240. ISBN 9781728121390
Abstract
This research demonstrates a method of discriminating the numerical relationships of neural network inputs to the outputs established from the learnt weights and biases of a neural network's generalisation model. It is demonstrated with a mathematical form of a neural network rather than an image, speech or textual translation application as this provides clarity in the understanding gained from the generalisation model. It is also reliant on the input format but that format is not unlike a image pixel input format and as such the research is applicable to other complex applications too. The research results will show that weight and biases can be used to discriminate the mathematical relationships between inputs and make discriminations of what mathematical operators are used between then in the learnt generalisation model. This may be a step towards gaining definitions and understanding for intractable problems that a Neural Network has generalised in a solution, for validating them or as a mechanism for creating a model used as an alternative to traditional approaches and derived from a neural network approach as a development tool for solving those problems. The demonstrated method was optimised using learning rate and the number of nodes and in this example achieves a low loss at 7.6e-6, a low Mean Absolute Error at 1e-3 with a high accuracy score of 1.0. But during the experiments a sensitivity to the number of epochs and the use of the random shuffle was discovered, and a comparison with an alternative shuffle using a non-random reordering demonstrated a lower but comparable performance, and is a subject for further research but demonstrated in this “decomposition” class architecture.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2019 IEEE. |
Keywords: | Weight capture; Information Assurance; Safety-Critical AI; Decomposition Rule-Extraction |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Automatic Control and Systems Engineering (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 06 Aug 2019 14:54 |
Last Modified: | 12 Feb 2020 16:53 |
Status: | Published |
Publisher: | IEEE |
Refereed: | Yes |
Identification Number: | 10.1109/iCCECE46942.2019.8941988 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:149294 |