Paetzold, G.H. and Specia, L. orcid.org/0000-0002-5495-3128 (2017) A survey on lexical simplification. Journal of Artificial Intelligence Research, 60. pp. 549-593. ISSN 1076-9757
Abstract
Lexical Simplification is the process of replacing complex words in a given sentence with simpler alternatives of equivalent meaning. This task has wide applicability both as an assistive technology for readers with cognitive impairments or disabilities, such as Dyslexia and Aphasia, and as a pre-processing tool for other Natural Language Processing tasks, such as machine translation and summarisation. The problem is commonly framed as a pipeline of four steps: the identification of complex words, the generation of substitution candidates, the selection of those candidates that fit the context, and the ranking of the selected substitutes according to their simplicity. In this survey we review the literature for each step in this typical Lexical Simplification pipeline and provide a benchmarking of existing approaches for these steps on publicly available datasets. We also provide pointers for datasets and resources available for the task.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2017 AI Access Foundation, Inc. This is an author produced version of a paper subsequently published in Journal of Artificial Intelligence Research. Uploaded in accordance with the publisher's self-archiving policy. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Funding Information: | Funder Grant number EUROPEAN COMMISSION - HORIZON 2020 SIMPATICO - 692819 |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 22 Jun 2018 09:34 |
Last Modified: | 02 Jul 2018 15:34 |
Published Version: | https://jair.org/index.php/jair/article/view/11091 |
Status: | Published |
Publisher: | AI Access Foundation |
Refereed: | Yes |
Identification Number: | 10.1613/jair.5526 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:132086 |