Yu, J., Moosavi, N.S. orcid.org/0000-0002-8332-307X, Paun, S. et al. (1 more author)
(2020)
Free the plural: unrestricted split-antecedent anaphora resolution.
In: Scott, D., Bel, N. and Zong, C., (eds.)
Proceedings of the 28th International Conference on Computational Linguistics.
COLING'2020: 28th International Conference on Computational Linguistics, 08-13 Dec 2020, Barcelona, Spain (Online).
International Committee on Computational Linguistics
, pp. 6113-6125.
ISBN 9781952148279
Abstract
Now that the performance of coreference resolvers on the simpler forms of anaphoric reference has greatly improved, more attention is devoted to more complex aspects of anaphora. One limitation of virtually all coreference resolution models is the focus on single-antecedent anaphors. Plural anaphors with multiple antecedents-so-called split-antecedent anaphors (as in John met Mary. They went to the movies) have not been widely studied, because they are not annotated in ONTONOTES and are relatively infrequent in other corpora. In this paper, we introduce the first model for unrestricted resolution of split-antecedent anaphors. We start with a strong baseline enhanced by BERT embeddings, and show that we can substantially improve its performance by addressing the sparsity issue. To do this, we experiment with auxiliary corpora where split-antecedent anaphors were annotated by the crowd, and with transfer learning models using element-of bridging references and single-antecedent coreference as auxiliary tasks. Evaluation on the gold annotated ARRAU corpus shows that the out best model uses a combination of three auxiliary corpora achieved F1 scores of 70% and 43.6% when evaluated in a lenient and strict setting, respectively, i.e., 11 and 21 percentage points gain when compared with our baseline.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | © 2020 The Author(s). Available under a Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/). |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 07 Sep 2022 14:51 |
Last Modified: | 07 Sep 2022 14:51 |
Status: | Published |
Publisher: | International Committee on Computational Linguistics |
Refereed: | Yes |
Identification Number: | 10.18653/v1/2020.coling-main.538 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:190603 |