Roupakia, Z., Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M.J.F. (2012) Rapid nonlinear speaker adaptation for large-vocabulary continuous speech recognition. In: INTERSPEECH 2012 : 13th Annual Conference of the International Speech Communication Association. INTERSPEECH 2012 : 13th Annual Conference of the International Speech Communication Association, 09-13 Sep 2012, Portland, OR, USA. International Speech Communication Association (ISCA) , pp. 1784-1787.
Abstract
Recently, kernel eigenvoices were revisited using kernel representations of distributions for rapid nonlinear speaker adaptation. These representations reassure the validity of the adapted distribution functions and enable expectation-maximisation training. Though gains have been shown in terms of word error rate for rapid speaker adaptation, this approach leads to an increase in decoding cost as the number of likelihood evaluations is amplified. The present paper addresses this issue by providing a coherent framework for systematic probabilistic approaches aimed at reducing the recognition cost and yet yielding equally powerful adapted models. The common denominator of such approaches is the use of probabilistic criteria, such as Kullback-Leibler divergence. However, in the general case, the resulting adapted models have full covariance matrices. In order to overcome this issue, the use of predictive semi-tied transforms to yield diagonal covariances for decoding is investigated in this paper. Experimental results are presented on a large-vocabulary conversational telephone task.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2012 International Speech Communication Association (ISCA). |
Keywords: | kernel eigenvoices; compact nonlinear adaptation; Kullback Leibler divergence |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 15 Nov 2019 11:23 |
Last Modified: | 15 Nov 2019 11:23 |
Published Version: | https://www.isca-speech.org/archive/interspeech_20... |
Status: | Published |
Publisher: | International Speech Communication Association (ISCA) |
Refereed: | Yes |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:152849 |