White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

Experiments on data fusion using headline information

Shou, X.M. and Sanderson, M. (2002) Experiments on data fusion using headline information. In: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval. Annual ACM Conference on Research and Development in Information Retrieval, August 11 - 15, 2002, Tampere, Finland. ACM , New York , pp. 413-414. ISBN 1-58113-561-0

Full text available as:

Abstract

This poster describes initial work exploring a relatively unexamined area of data fusion: fusing the results of retrieval systems whose collections have no overlap between them. Many of the effective meta-search/data fusion strategies gain much of their success from exploiting document overlap across the source systems being merged. When the intersection of the collections is the empty set, the strategies generally degrade to a simpler form. In order to address such situations, two strategies were examined: re-ranking of merged results using a locally run search on the text fragments returned by the source search engines; and re-ranking based on cross document similarity, again using text fragments presented in the retrieved list. Results, from experiments, which go beyond previous work, indicate that both strategies improve fusion effectiveness.

Item Type: Proceedings Paper
Copyright, Publisher and Additional Information: © 2002 The Authors. This is an author produced version of a paper published in "Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval". Uploaded in accordance with the publisher's self-archiving policy.
Academic Units: The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield)
Depositing User: Repository Officer
Date Deposited: 03 Sep 2008 11:52
Last Modified: 08 Feb 2013 16:56
Published Version: http://dx.doi.org/10.1145/564376.564470
Status: Published
Publisher: ACM
Identification Number: 10.1145/564376.564470
URI: http://eprints.whiterose.ac.uk/id/eprint/4596

Actions (login required)

View Item View Item