Malinin, A., Ragni, A. orcid.org/0000-0003-0634-4456, Knill, K. et al. (1 more author) (2017) Incorporating uncertainty into deep learning for spoken language assessment. In: Barzilay, R. and Kan, M.-Y., (eds.) Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2 : Short Papers). 55th Annual Meeting of the Association for Computational Linguistics, 30 Jul - 04 Aug 2017, Vancouver, Canada. Association for Computational Linguistics ISBN 9781945626760
Abstract
There is a growing demand for automatic assessment of spoken English proficiency. These systems need to handle large variations in input data owing to the wide range of candidate skill levels and L1s, and errors from ASR. Some candidates will be a poor match to the training data set, undermining the validity of the predicted grade. For high stakes tests it is essential for such systems not only to grade well, but also to provide a measure of their uncertainty in their predictions, enabling rejection to human graders. Previous work examined Gaussian Process (GP) graders which, though successful, do not scale well with large data sets. Deep Neural Network (DNN) may also be used to provide uncertainty using Monte-Carlo Dropout (MCD). This paper proposes a novel method to yield uncertainty and compares it to GPs and DNNs with MCD. The proposed approach explicitly teaches a DNN to have low uncertainty on training data and high uncertainty on generated artificial data. On experiments conducted on data from the Business Language Testing Service (BULATS), the proposed approach is found to outperform GPs and DNNs with MCD in uncertainty-based rejection whilst achieving comparable grading performance.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | © 2017 Association for Computational Linguistics. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 08 Nov 2019 14:41 |
Last Modified: | 08 Nov 2019 14:41 |
Status: | Published |
Publisher: | Association for Computational Linguistics |
Refereed: | Yes |
Identification Number: | 10.18653/v1/p17-2008 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:152824 |