Fox, C and Hain, T (2014) Extending Limabeam with discrimination and coarse gradients. In: 15th Annual Conference of the International Speech Communication Association. Interspeech-2014, 14-18 Sep 2014, Singapore. ISCA , pp. 2240-2444.
Abstract
Limabeam is an approach to multi-microphone array processing for ASR which makes minimal assumptions about system geometry, instead searching for filters to maximise output likelihoods under a speech model. The first results of Limabeam on the AMI meeting corpus are given, then two extensions of the algorithm for this corpus. First, it is shown that the original local gradient following sticks in local minima, and a coarser gradient is used. Second, a new discriminative objective function is provided to handle mismatched silence models. The extensions are based on examination of 2D receptive fields and 2D likelihood maps which are novel near-field analogs of radial beamformer response patterns, but do not show radial symmetry and have many local minima. The extended Limabeam improves WER on TDOA baselines on the AMI corpus, by 1% rel. when both are adapted with decodes and by 19% rel. when both adapted with ground truth.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2014 ISCA. Reproduced in accordance with the publisher's self-archiving policy. |
Keywords: | ASR, beamforming, discriminative |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Environment (Leeds) > Institute for Transport Studies (Leeds) > ITS: Safety and Technology (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 27 Jun 2016 14:36 |
Last Modified: | 03 Jan 2017 13:10 |
Published Version: | http://www.isca-speech.org/archive/interspeech_201... |
Status: | Published |
Publisher: | ISCA |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:87632 |