Mixing and merging for spoken document retrieval

Sanderson, M. and Crestani, F. (1998) Mixing and merging for spoken document retrieval. In: Nikolaou, C. and Stephanidis, C., (eds.) Research and Advanced Technology for Digital Libraries. Second European Conference, ECDL'98, Heraklion, Crete, Greece, September 21-23, 1998, Proceedings. Lecture Notes in Computer Science (1513). Springer , p. 518. ISBN 978-3-540-65101-7

Abstract

This paper describes a number of experiments that explo- red the issues surrounding the retrieval of spoken documents. Two such issues were examined. First, attempting to find the best use of speech recogniser output to produce the highest retrieval effectiveness. Second, investigating the potential problems of retrieving from a so-called "mi- xed collection", i.e. one that contains documents from both a speech recognition system (producing many errors) and from hand transcription (producing presumably near perfect documents). The result of the first part of the work found that merging the transcripts of multiple recognisers showed most promise. The investigation in the second part showed how the term weighting scheme used in a retrieval system was important in determining whether the system was affected detrimentally when retrieving from a mixed collection.

Metadata

Authors/Creators:	Sanderson, M. Crestani, F.
Copyright, Publisher and Additional Information:	© 1998 Springer. This is an author produced version of a published paper. Uploaded in accordance with the publisher's self-archiving policy.
Dates:	Published: January 1998
Institution:	The University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield)
Depositing User:	Repository Officer
Date Deposited:	08 Sep 2008 17:24
Last Modified:	19 Jun 2014 00:33
Published Version:	http://dx.doi.org/10.1007/3-540-49653-X_24
Status:	Published
Publisher:	Springer
Series Name:	Lecture Notes in Computer Science
Identification Number:	https://doi.org/10.1007/3-540-49653-X_24

CORE (COnnecting REpositories)

Mixing and merging for spoken document retrieval

Abstract

Metadata

Download

Euro_DL_2nd

Export

Statistics