Ramisch, C., Cordeiro, S. and Villavicencio, A. orcid.org/0000-0002-3731-9168 (2016) Filtering and measuring the intrinsic quality of human compositionality judgments. In: Kordoni, V., Cholakov, K., Egg, M., Markantonat, S. and Nakov, P., (eds.) Proceedings of the 12th Workshop on Multiword Expressions. 12th Workshop on Multiword Expressions (MWE’2016), 11 Aug 2016, Berlin, Germany. Association for Computational Linguistics ISBN 9781945626067
Abstract
This paper analyzes datasets with numerical scores that quantify the semantic compositionality of MWEs. We present the results of our analysis of crowdsourced compositionality judgments for noun compounds in three languages. Our goals are to look at the characteristics of the annotations in different languages; to examine intrinsic quality measures for such data; and to measure the impact of filters proposed in the literature on these measures. The cross-lingual results suggest that greater agreement is found for the extremes in the compositionality scale, and that outlier annotation removal is more effective than outlier annotator removal.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | © 2016 Association for Computational Linguistics |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 21 Nov 2019 12:25 |
Last Modified: | 21 Nov 2019 12:25 |
Status: | Published |
Publisher: | Association for Computational Linguistics |
Refereed: | Yes |
Identification Number: | 10.18653/v1/w16-1804 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:153567 |