Nicolao, M. orcid.org/0000-0002-4680-2549, Sanders, M. and Hain, T. orcid.org/0000-0003-0939-3464 (2018) Improved acoustic modelling for automatic literacy assessment of children. In: Proceedings of Interspeech 2018. Interspeech 2018, 02-06 Sep 2018, Hyderabad, India. ISCA , pp. 1666-1670.
Abstract
Automatic literacy assessment of children is a complex task that normally requires carefully annotated data. This paper focuses on a system for the assessment of reading skills, aiming to detection of a range of fluency and pronunciation errors. Naturally, reading is a prompted task, and thereby the acquisition of training data for acoustic modelling should be straightforward. However, given the prominence of errors in the training set and the importance of labelling them in the transcription, a lightly supervised approach to acoustic modelling has better chances of success. A method based on weighted finite state transducers is proposed, to model specific prompt corrections, such as repetitions, substitutions, and deletions, as observed in real recordings. Iterative cycles of lightly-supervised training are performed in which decoding improves the transcriptions and the derived models. Improvements are due to increasing accuracy in phone-to-sound alignment and in the training data selection. The effectiveness of the proposed methods for rela-belling and acoustic modelling is assessed through experiemnts on the CHOREC corpus, in terms of sequence error rate and alignment accuracy. Improvements over the baseline of up to 60% and 23.3% respectively are observed.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2018 ISCA. Reproduced in accordance with the publisher's self-archiving policy. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Funding Information: | Funder Grant number ITSLANGUAGE BV none |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 29 Oct 2018 14:16 |
Last Modified: | 29 Oct 2018 14:17 |
Published Version: | https://doi.org/10.21437/Interspeech.2018-2118 |
Status: | Published |
Publisher: | ISCA |
Refereed: | Yes |
Identification Number: | 10.21437/Interspeech.2018-2118 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:137868 |