Non-random weight initialisation in deep learning networks for repeatable determinism

Rudd-Orthner, R. and Mihaylova, L. (2019) Non-random weight initialisation in deep learning networks for repeatable determinism. In: 2019 IEEE 10th International Conference on Dependable Systems, Services and Technologies (DESSERT). 10th International Conference Dependable Systems, Services and Technologies, 05-07 Jun 2019, Leeds, United Kingdom. IEEE ISBN 9781728117348

Abstract

This research is examining the change in weight values of deep learning networks after learning. These research experiments require to make measurements and comparisons from a stable set of known weights and biases before and after learning is conducted, such that comparisons after learning are repeatable and the experiment is controlled. As such the current accepted schemes of random number initialisations of the weight values may need to be deterministic rather than stochastic to have little run to run varying effects, so that the weight value initialisations are not a varying contributor. This paper looks at the viability of non-random weight initialisation schemes, to be used in place of the random number weight initialisations of an established well understood test case. The viability of non-random weight initialisation schemes in neural networks may make a network more deterministic in learning sessions which is a desirable property in mission and safety critical systems. The paper will use a variety of schemes over number ranges and gradients and will achieve a 97.97% accuracy figure just 0.18% less than the original random number scheme at 98.05%. The paper may highlight that in this case it may be the number range and not the gradient that is effecting the achieved accuracy most dominantly, although there may be a coupling of number range with activation functions used. Unexpectedly in this paper, an effect of numerical instability will be discovered from run to run when run on a multi-core CPU. The paper will also show the enforcement of consistent deterministic results on an multi-core CPU by defining atomic critical code regions aiding repeatable Information Assurance (IA) in model fitting (or learning sessions).

Metadata

Item Type:	Proceedings Paper
Authors/Creators:	Rudd-Orthner, R. Mihaylova, L.
Copyright, Publisher and Additional Information:	© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works. Reproduced in accordance with the publisher's self-archiving policy.
Keywords:	Deep learning; Safety; Mission critical systems; Task analysis; Fitting
Dates:	Accepted: 12 April 2019 Published (online): 25 July 2019 Published: 7 June 2019
Institution:	The University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Automatic Control and Systems Engineering (Sheffield)
Depositing User:	Symplectic Sheffield
Date Deposited:	28 May 2019 11:48
Last Modified:	25 Jul 2020 00:38
Status:	Published
Publisher:	IEEE
Refereed:	Yes
Identification Number:	10.1109/DESSERT.2019.8770007
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:146436

CORE (COnnecting REpositories)

Non-random weight initialisation in deep learning networks for repeatable determinism

Abstract

Metadata

Download

Accepted Version

Export

Statistics