Knill, K.M., Gales, M.J.F., Kyriakopoulos, K. et al. (2 more authors) (2017) Use of graphemic lexicons for spoken language assessment. In: Proceedings of Interspeech 2017. Interspeech 2017, 20-24 Aug 2017, Stockholm, Sweden. International Speech Communication Association (ISCA) , pp. 2774-2778.
Abstract
Automatic systems for practice and exams are essential to support the growing worldwide demand for learning English as an additional language. Assessment of spontaneous spoken English is, however, currently limited in scope due to the difficulty of achieving sufficient automatic speech recognition (ASR) accuracy. “Off-the-shelf” English ASR systems cannot model the exceptionally wide variety of accents, pronunciations and recording conditions found in non-native learner data. Limited training data for different first languages (L1s), across all proficiency levels, often with (at most) crowd-sourced transcriptions, limits the performance of ASR systems trained on non-native English learner speech. This paper investigates whether the effect of one source of error in the system, lexical modelling, can be mitigated by using graphemic lexicons in place of phonetic lexicons based on native speaker pronunciations. Graphemic-based English ASR is typically worse than phonetic-based due to the irregularity of English spelling-to-pronunciation but here lower word error rates are consistently observed with the graphemic ASR. The effect of using graphemes on automatic assessment is assessed on different grader feature sets: audio and fluency derived features, including some phonetic level features; and phone/grapheme distance features which capture a measure of pronunciation ability.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2017 ISCA. Reproduced in accordance with the publisher's self-archiving policy. |
Keywords: | graphemic speech recognition; spoken language assessment; automatic grading; non-native speakers |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 21 Nov 2019 12:27 |
Last Modified: | 21 Nov 2019 12:27 |
Status: | Published |
Publisher: | International Speech Communication Association (ISCA) |
Refereed: | Yes |
Identification Number: | 10.21437/interspeech.2017-978 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:152765 |