White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

Word sense disambiguation and information retrieval

Sanderson, M. (1994) Word sense disambiguation and information retrieval. In: Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval. SIGIR '94, July 03 - 06, 1994, Dublin, Ireland. Springer , New York , pp. 142-151. ISBN 0-387-19889-X

Full text available as:
[img]
Preview
Text
SIGIR94.pdf

Download (90Kb)

Abstract

It has often been thought that word sense ambiguity is a cause of poor performance in Information Retrieval (IR) systems. The belief is that if ambiguous words can be correctly disambiguated, IR performance will increase. However, recent research into the application of a word sense disambiguator to an IR system failed to show any performance increase. From these results it has become clear that more basic research is needed to investigate the relationship between sense ambiguity, disambiguation, and IR.

Using a technique that introduces additional sense ambiguity into a collection, this paper presents research that goes beyond previous work in this field to reveal the influence that ambiguity and disambiguation have on a probabilistic IR system. We conclude that word sense ambiguity is only problematic to an IR system when it is retrieving from very short queries. In addition we argue that if a word sense disambiguator is to be of any use to an IR system, the disambiguator must be able to resolve word senses to a high degree of accuracy.

Item Type: Proceedings Paper
Institution: The University of Sheffield
Academic Units: The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield)
Depositing User: Repository Officer
Date Deposited: 18 Nov 2008 13:11
Last Modified: 05 Jun 2014 08:57
Status: Published
Publisher: Springer
URI: http://eprints.whiterose.ac.uk/id/eprint/4922

Actions (login required)

View Item View Item