Roller, R.A. and Stevenson, M. (2015) Held-out versus Gold Standard: Comparison of Evaluation Strategies for Distantly Supervised Relation Extraction from Medline abstracts. In: Proceedings of the Sixth International Workshop on Health Text Mining and Information Analysis. , 17 Sep 2015, Lisbon, Portugal. Association for Computational Linguistics , pp. 97-102.
Abstract
Distant supervision is a useful technique for creating relation classifiers in the absence of labelled data. The approaches are often evaluated using a held-out portion of the distantly labelled data, thereby avoiding the need for lablelled data entirely. However, held-out evaluation means that systems are tested against noisy data, making it difficult to determine their true accuracy. This paper examines the effectiveness of using held-out data to evaluate relation extraction systems by comparing the results that are produced with those generated using manually labelled versions of the same data. We train classifiers to detect two UMLS Metathesaurus relations (may-treat and may-prevent) in Medline abstracts. A new evaluation data set for these relations is made available. We show that evaluation against a distantly labelled gold standard tends to overestimate performance and that no direct connection can be found between improved performance against distantly and manually labelled gold standards.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2015 Association for Computational Linguistics. Licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License (https://creativecommons.org/licenses/by-nc-sa/3.0/). Permission is granted to make copies for the purposes of teaching and research. ACL Anthology: http://www.aclweb.org/anthology/index.html |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 29 Jan 2016 15:12 |
Last Modified: | 29 Jan 2016 15:12 |
Published Version: | http://aclweb.org/anthology/W15-2612 |
Status: | Published |
Publisher: | Association for Computational Linguistics |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:90280 |