Ruiz-Garcia, A, Elshaw, M, Altahhan, A orcid.org/0000-0003-1133-7744 et al. (1 more author)
(2017)
Stacked deep convolutional auto-encoders for emotion recognition from facial expressions.
In:
2017 International Joint Conference on Neural Networks (IJCNN).
International Joint Conference on Neural Networks (IJCNN), 14-19 May 2017, Anchorage, Alaska, USA.
IEEE
, pp. 1586-1593.
ISBN 978-1-5090-6183-9
Abstract
Emotion recognition is critical for everyday living and is essential for meaningful interaction. If we are to progress towards human and machine interaction that is engaging the human user, the machine should be able to recognize the emotional state of the user. Deep Convolutional Neural Networks (CNN) have proven to be efficient in emotion recognition problems. The good degree of performance achieved by these classifiers can be attributed to their ability to self-learn a down-sampled feature vector that retains spatial information through filter kernels in convolutional layers. Given the view that random initialization of weights can lead to convergence to non-optimal local minima, in this paper we explore the impact of training the initial weights in an unsupervised manner. We study the effect of pre-training a Deep CNN as a Stacked Convolutional Auto-Encoder (SCAE) in a greedy layer-wise unsupervised fashion for emotion recognition using facial expression images. When trained with randomly initialized weights, our CNN emotion recognition model achieves a performance rate of 91.16% on the Karolinska Directed Emotional Faces (KDEF) dataset. In contrast, when each layer of the model, including the hidden layer, is pre-trained as an Auto-Encoder, the performance increases to 92.52%. Pre-training our CNN as a SCAE also reduces training time marginally. The emotion recognition model developed in this work will form the basis of a real-time empathic robot system.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
Keywords: | Emotion recognition , Training , Feature extraction , Convolution , Robots , Kernel , Convergence |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 24 Nov 2020 11:39 |
Last Modified: | 25 Nov 2020 19:31 |
Status: | Published |
Publisher: | IEEE |
Identification Number: | 10.1109/IJCNN.2017.7966040 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:168215 |