Sehgal, S. and Cunningham, S. orcid.org/0000-0001-9418-8726 (2015) Model adaptation and adaptive training for the recognition of dysarthric speech. In: Alexandersson, J., Altinsoy, E., Christensen, H., Ljunglöf , P., Portet , F. and Rudzicz , F., (eds.) Proceedings of SLPAT 2015: 6th Workshop on Speech and Language Processing for Assistive Technologies. 6th Workshop on Speech and Language Processing for Assistive Technologies, 11/09/2015, Dresden, Germany. Association for Computational Linguistics , pp. 65-71.
Abstract
Dysarthria is a neurological speech disorder, which exhibits multi-fold disturbances in the speech production system of an individual and can have a detrimental effect on the speech output. In addition to the data sparseness problems, dysarthric speech is characterised by inconsistencies in the acoustic space making it extremely challenging to model. This paper investigates a variety of baseline speaker independent (SI) systems and its suitability for adaptation. The study also explores the usefulness of speaker adaptive training (SAT) for implicitly annihilating inter-speaker variations in a dysarthric corpus. The paper implements a hybrid MLLR-MAP based approach to adapt the SI and SAT systems. ALL the results reported uses UASPEECH dysarthric data. Our best adapted systems gave a significant absolute gain of 11.05% (20.42% relative) over the last published best result in the literature. A statistical analysis performed across various systems and its specific implementation in modelling different dysarthric severity sub-groups, showed that, SAT-adapted systems were more applicable to handle disfluencies of more severe speech and SI systems prepared from typical speech were more apt for modelling speech with low level of severity.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | © 2015 The Association for Computational Linguistics. Article licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License (https://creativecommons.org/licenses/by-nc-sa/3.0/). Permission is granted to make copies for the purposes of teaching and research. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Medicine, Dentistry and Health (Sheffield) > Department of Human Communication Sciences (Sheffield) The University of Sheffield > Faculty of Medicine, Dentistry and Health (Sheffield) > School of Health and Related Research (Sheffield) > ScHARR - Sheffield Centre for Health and Related Research |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 27 Jul 2017 09:40 |
Last Modified: | 27 Jul 2017 09:40 |
Published Version: | https://doi.org/10.18653/v1/W15-5112 |
Status: | Published |
Publisher: | Association for Computational Linguistics |
Refereed: | Yes |
Identification Number: | 10.18653/v1/W15-5112 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:119401 |