Villavicencio, A. orcid.org/0000-0002-3731-9168 and Idiart, M. (2019) Discovering multiword expressions. Natural Language Engineering, 25 (6). pp. 715-733. ISSN 1351-3249
Abstract
In this paper, we provide an overview of research on multiword expressions (MWEs), from a natural lan- guage processing perspective. We examine methods developed for modelling MWEs that capture some of their linguistic properties, discussing their use for MWE discovery and for idiomaticity detection. We con- centrate on their collocational and contextual preferences, along with their fixedness in terms of canonical forms and their lack of word-for-word translatatibility. We also discuss a sample of the MWE resources that have been used in intrinsic evaluation setups for these methods.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2019 Cambridge University Press. This is an author-produced version of a paper subsequently published in Natural Language Engineering. Article available under the terms of the CC-BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0/). |
Keywords: | Multiword expressions; Association measures; Compositionality; Idiomaticity |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 18 Nov 2019 10:22 |
Last Modified: | 01 Jul 2020 10:56 |
Status: | Published |
Publisher: | Cambridge University Press (CUP) |
Refereed: | Yes |
Identification Number: | 10.1017/S1351324919000494 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:153553 |