Alfaifi, A and Atwell, ES orcid.org/0000-0001-9395-3764 (2016) Comparative evaluation of tools for Arabic corpora search and analysis. International Journal of Speech Technology, 19 (2). pp. 347-357. ISSN 1381-2416
Abstract
As the number of Arabic corpora is constantly increasing, there is an obvious and growing need for concordancing software for corpus search and analysis that supports as many features as possible of the Arabic language, and provides users with a greater number of functions. This paper evaluates six existing corpus search and analysis tools based on eight criteria which seem to be the most essential for searching and analysing Arabic corpora, such as displaying Arabic text in its right-to-left direction, normalising diacritics and Hamza, and providing an Arabic user interface. The results of the evaluation revealed that three tools: Khawas, Sketch Engine, and aConCorde, have met most of the evaluation criteria and achieved the highest benchmark scores. The paper concluded that developers’ conscious consideration of the linguistic features of Arabic when designing these three tools was the most significant factor behind their superiority.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2016, Springer. This is an author produced version of a paper published in International Journal of Speech Technology. Uploaded in accordance with the publisher's self-archiving policy. The final publication is available at Springer via http://dx.doi.org/10.1007/s10772-015-9285-5 |
Keywords: | Arabic, Tool, Corpus, Evaluation, Analysis |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) > Artificial Intelligence & Biological Systems (Leeds) |
Funding Information: | Funder Grant number EPSRC EP/K015206/1 |
Depositing User: | Symplectic Publications |
Date Deposited: | 21 Jun 2016 11:45 |
Last Modified: | 12 Apr 2017 16:03 |
Published Version: | http://dx.doi.org/10.1007/s10772-015-9285-5 |
Status: | Published |
Publisher: | Springer Verlag |
Identification Number: | 10.1007/s10772-015-9285-5 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:101162 |