Liu, Y., Zhang, P. and Hain, T. (2014) Using neural network front-ends on far field multiple microphones based speech recognition. In: Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on. 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 04-09 May 2014, Florence, Italy. IEEE , pp. 5542-5546.
Abstract
This paper presents an investigation of far field speech recognition using beamforming and channel concatenation in the context of Deep Neural Network (DNN) based feature extraction. While speech enhancement with beamforming is attractive, the algorithms are typically signal-based with no information about the special properties of speech. A simple alternative to beamforming is concatenating multiple channel features. Results presented in this paper indicate that channel concatenation gives similar or better results. On average the DNN front-end yields a 25% relative reduction in Word Error Rate (WER). Further experiments aim at including relevant information in training adapted DNN features. Augmenting the standard DNN input with the bottleneck feature from a Speaker Aware Deep Neural Network (SADNN) shows a general advantage over the standard DNN based recognition system, and yields additional improvements for far field speech recognition.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2014 IEEE. This is an author produced version of a paper subsequently published in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on. Uploaded in accordance with the publisher's self-archiving policy. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 19 Jan 2015 10:13 |
Last Modified: | 19 Dec 2022 13:26 |
Published Version: | http://dx.doi.org/10.1109/ICASSP.2014.6854663 |
Status: | Published |
Publisher: | IEEE |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:78166 |