Alghamdi, A and Atwell, E orcid.org/0000-0001-9395-3764 (2019) Constructing a corpus-informed list of Arabic formulaic sequences (ArFSs) for language pedagogy and technology. International Journal of Corpus Linguistics, 24 (2). pp. 202-228. ISSN 1384-6655
Abstract
This study aims to construct a corpus-informed list of Arabic Formulaic Sequences (ArFSs) for use in language pedagogy (LP) and Natural Language Processing (NLP) applications. A hybrid mixed methods model was adopted for extracting ArFSs from a corpus, that combined automatic and manual extracting methods, based on well-established quantitative and qualitative criteria that are relevant from the perspective of LP and NLP. The pedagogical implications of this list are examined to facilitate the inclusion of ArFSs in the process of learning and teaching Arabic, particularly for non-native speakers. The computational implications of the ArFSs list are related to the key role of the ArFSs as a novel language resource in the improvement of various Arabic NLP tasks.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2019 John Benjamins Publishing Company. This is an author produced version of a paper published in International Journal of Corpus Linguistics. Please contact the publisher (John Benjamins) for permission to re-use or reprint this material in any form. Uploaded in accordance with the publisher's self-archiving policy. |
Keywords: | Arabic formulaic sequences; language pedagogy; lexical resources; mixed methods and multi-word expressions |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 04 Apr 2019 16:02 |
Last Modified: | 14 Aug 2019 05:22 |
Status: | Published |
Publisher: | John Benjamins Publishing |
Identification Number: | 10.1075/ijcl.16088.alg |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:144498 |