Liu, Y, Fox, C, Hasan, M et al. (1 more author) (2016) The Sheffield Wargame Corpus - day two and day three. In: Proceedings. Interspeech 2016, 08-12 Sep 2016, San Francisco, CA, USA. ISCA , pp. 3833-3837.
Abstract
Improving the performance of distant speech recognition is of considerable current interest, driven by a desire to bring speech recognition into people’s homes. Standard approaches to this task aim to enhance the signal prior to recognition, typically using beamforming techniques on multiple channels. Only few real-world recordings are available that allow experimentation with such techniques. This has become even more pertinent with recent works with deep neural networks aiming to learn beamforming from data. Such approaches require large multi-channel training sets, ideally with location annotation for moving speakers, which is scarce in existing corpora. This paper presents a freely available and new extended corpus of English speech recordings in a natural setting, with moving speakers. The data is recorded with diverse microphone arrays, and uniquely, with ground truth location tracking. It extends the 8.0 hour Sheffield Wargames Corpus released in Interspeech 2013, with a further 16.6 hours of fully annotated data, including 6.1 hours of female speech to improve gender bias. Additional blog-based language model data is provided alongside, as well as a Kaldi baseline system. Results are reported with a standard Kaldi configuration, and a baseline meeting recognition system.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2016 ISCA. Reproduced in accordance with the publisher's self-archiving policy. |
Keywords: | distant speech recognition, multi-channel speech recognition, natural speech corpora, deep neural network. |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Environment (Leeds) > Institute for Transport Studies (Leeds) > ITS: Safety and Technology (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 12 Jul 2016 10:06 |
Last Modified: | 03 Nov 2018 12:39 |
Published Version: | https://doi.org/10.21437/Interspeech.2016 |
Status: | Published |
Publisher: | ISCA |
Identification Number: | 10.21437/Interspeech.2016 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:102045 |