White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

Comparing knowledge sources for nominal anaphora resolution

Markert, K. and Nissim, M. (2005) Comparing knowledge sources for nominal anaphora resolution. Computational Linguistics, 31 (3). pp. 367-402. ISSN 0891-2017

Full text available as:
[img]
Preview
Text
KatjaCL2005.pdf
Available under License : See the attached licence file.

Download (274Kb)

Abstract

We compare two ways of obtaining lexical knowledge for antecedent selection in other-anaphora and definite noun phrase coreference. Specifically, we compare an algorithm that relies on links encoded in the manually created lexical hierarchy WordNet and an algorithm that mines corpora by means of shallow lexico-semantic patterns. As corpora we use the British National Corpus (BNC), as well as the Web, which has not been previously used for this task. Our results show that (a) the knowledge encoded in WordNet is often insufficient, especially for anaphor-antecedent relations that exploit subjective or context-dependent knowledge; (b) for other-anaphora, the Web-based method outperforms the WordNet-based method; (c) for definite NP coreference, the Web-based method yields results comparable to those obtained using WordNet over the whole dataset and outperforms the WordNet-based method on subsets of the dataset; (d) in both case studies, the BNC-based method is worse than the other methods because of data sparseness. Thus, in our studies, the Web-based method alleviated the lexical knowledge gap often encountered in anaphora resolution, and handled examples with context-dependent relations between anaphor and antecedent. Because it is inexpensive and needs no hand-modelling of lexical knowledge, it is a promising knowledge source to integrate in anaphora resolution systems.

Item Type: Article
Copyright, Publisher and Additional Information: © 2005 Association for Computational Linguistics. This is an author produced version of a paper published in Computational Linguistics. Uploaded in accordance with the publisher's self-archiving policy.
Academic Units: The University of Leeds > Faculty of Engineering (Leeds) > School of Computing (Leeds)
Depositing User: Mrs Irene Rudling
Date Deposited: 20 Jan 2009 10:37
Last Modified: 08 Feb 2013 17:05
Published Version: http://dx.doi.org/10.1162/089120105774321064
Status: Published
Publisher: MIT Press
Refereed: Yes
Identification Number: 10.1162/089120105774321064
URI: http://eprints.whiterose.ac.uk/id/eprint/5389

Actions (login required)

View Item View Item