Alosaimy, AMS and Atwell, E orcid.org/0000-0001-9395-3764 (Accepted: 2016) Ensemble Morphosyntactic Analyser for Classical Arabic. In: Proceedings. 2nd International Conference on Arabic Computational Linguistics, 03-09 Apr 2016, Konya, Turkey. (Unpublished)
Abstract
In Modern Standard Arabic text (MSA), there are at least seven available morphological analysers (MA). Several Part-of-Speech (POS) taggers use these MAs to improve accuracy. However, the choice between these analysers is challenging, and there is none designed for Classical Arabic. Several morphological analysers have been studied and combined to be evaluated on a common ground. The goal of our language resource is to build a freely accessible multi-component toolkit (named SAWAREF1) for part-of-speech tagging and morphological analysers that can provide a comparative evaluation, standardise the outputs of each component, combine different solutions, and analyse and vote for the best candidates. We illustrate the use of SAWAREF in tagging adjectives and shows how accuracy of tagging adjectives is still very low. This paper describes the research method and design, and discusses the key issues and obstacles.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Keywords: | ensemble; Morphosyntactic Analyser; POS tagger; arabic; combine; classical arabic |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 12 Aug 2016 10:55 |
Last Modified: | 28 Feb 2024 13:21 |
Status: | Unpublished |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:99602 |