Loweimi, E., Barker, J.P. and Hain, T. orcid.org/0000-0003-0939-3464 (2018) Exploring the use of group delay for generalised VTS based noise compensation. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing Proceedings. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing, 15-20 Apr 2018, Calgary, Alberta, Canada. IEEE ISBN 978-1-5386-4658-8
Abstract
In earlier work we studied the effect of statistical normalisation for phase-based features and observed it leads to a significant robustness improvement. This paper explores the extension of the generalised Vector Taylor Series (gVTS) noise compensation approach to the group delay (GD) domain. We discuss the problems it presents, propose some solutions and derive the corresponding formulae. Furthermore, the effects of additive and channel noise in the GD domain were studied. It was observed that the GD of the noisy observation is a convex combination of the GDs of the clean signal and the additive noise and also in the expected sense, channel GD tends to zero. Experiments on Aurora-4 showed that, despite training only on the clean speech, the proposed features provide average WER reductions of 0.8% absolute and 4.1% relative compared to an MFCC-based system trained on the multi-style data. Combining the gVTS with a bottleneck DNN-based system led to average absolute (relative) WER improvements of 6.0% (23.5%) when training on clean data and 2.5% (13.8%) when using multi-style training with additive noise.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works. Reproduced in accordance with the publisher's self-archiving policy. |
Keywords: | Robust ASR; generalised VTS; phase spectrum; group delay; product spectrum |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 11 Jun 2018 09:15 |
Last Modified: | 16 Oct 2018 11:31 |
Published Version: | https://doi.org/10.1109/ICASSP.2018.8462595 |
Status: | Published |
Publisher: | IEEE |
Refereed: | Yes |
Identification Number: | 10.1109/ICASSP.2018.8462595 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:131627 |