Al-Sulaiti, L and Atwell, ES (2006) The design of a corpus of contemporary Arabic. International Journal of Corpus Linguistics, 11 (2). 135 - 171. ISSN 1384-6655
Abstract
Corpora are an important resource for both teaching and research. Arabic lacks sufficient resources in this field, so a research project has been designed to compile a corpus, which represents the state of the Arabic language at the present time and the needs of end-users. This report presents the result of a survey of the needs of teachers of Arabic as a foreign language (TAFL) and language engineers. The survey shows that a wide range of text types should be included in the corpus. Overall, our survey confirms our view that existing corpora are too narrowly limited in source-type and genre, and that there is a need for a freely-accessible corpus of contemporary Arabic covering a broad range of text-types. We have collected and published an initial version of the Corpus of Contemporary Arabic (CCA) to meet these design issues. The CCA is freely downloadable via WWW from http://www.comp.leeds.ac.uk/arabic.
Metadata
Authors/Creators: |
|
---|---|
Keywords: | Corpus; contemporary; Arabic; Arabic; design; language variation; teaching Arabic as a foreign language (TAFL); Language Engineering |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) > Artificial Intelligence & Biological Systems (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 01 Dec 2014 11:30 |
Last Modified: | 04 Nov 2016 06:45 |
Published Version: | http://dx.doi.org/10.1075/ijcl.11.2.02als |
Status: | Published |
Publisher: | John Benjamins Publishing Company |
Identification Number: | https://doi.org/10.1075/ijcl.11.2.02als |