Sanderson, M. and Shou, X.M. (2007) Search of spoken documents retrieves well recognized transcripts. In: Amati, G., Carpineto, C. and Romano, G., (eds.) Advances in Information Retrieval. 29th European Conference on IR Research, ECIR 2007, Rome, Italy, April 2-5, 2007, Proceedings. Lecture Notes in Computer Science (4425). Springer , pp. 505-516. ISBN 978-3-540-71494-1
Abstract
This paper presents a series of analyses and experiments on spoken document retrieval systems: search engines that retrieve transcripts produced by speech recognizers. Results show that transcripts that match queries well tend to be recognized more accurately than transcripts that match a query less well. This result was described in past literature, however, no study or explanation of the effect has been provided until now. This paper provides such an analysis showing a relationship between word error rate and query length. The paper expands on past research by increasing the number of recognitions systems that are tested as well as showing the effect in an operational speech retrieval system. Potential future lines of enquiry are also described.
Metadata
Item Type: | Book Section |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | © Springer-Verlag Berlin Heidelberg 2007. This is an author produced version of a paper published in ECIR 2007, LNCS 4425. Uploaded in accordance with the publisher's self archving policy. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield) |
Depositing User: | Repository Officer |
Date Deposited: | 08 Sep 2008 17:26 |
Last Modified: | 08 Feb 2013 16:56 |
Published Version: | http://dx.doi.org/10.1007/978-3-540-71496-5 |
Status: | Published |
Publisher: | Springer |
Series Name: | Lecture Notes in Computer Science |
Identification Number: | 10.1007/978-3-540-71496-5 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:4588 |