Brierley, C, Sawalha, M and Atwell, E orcid.org/0000-0001-9395-3764 (2014) Tools for Arabic Natural Language Processing: a case study in qalqalah prosody. In: 9th International Conference on Language Resources and Evaluation. LREC 2014, 26-31 May 2014, Reykjavik, Iceland. European Language Resources Association , pp. 283-287. ISBN 978-2-9517408-8-4
Abstract
In this paper, we focus on the prosodic effect of qalqalah or "vibration" applied to a subset of Arabic consonants under certain constraints during correct Qur'anic recitation or taǧwīd, using our Boundary-Annotated Qur’an dataset of 77430 words (Brierley et al 2012; Sawalha et al 2014). These qalqalah events are rule-governed and are signified orthographically in the Arabic script. Hence they can be given abstract definition in the form of regular expressions and thus located and collected automatically. High frequency qalqalah content words are also found to be statistically significant discriminators or keywords when comparing Meccan and Medinan chapters in the Qur'an using a state-of-the-art Visual Analytics toolkit: Semantic Pathways. Thus we hypothesise that qalqalah prosody is one way of highlighting salient items in the text. Finally, we implement Arabic transcription technology (Brierley et al under review; Sawalha et al forthcoming) to create a qalqalah pronunciation guide where each word is transcribed phonetically in IPA and mapped to its chapter-verse ID. This is funded research under the EPSRC "Working Together" theme.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © European Language Resources Association. The LREC 2014 Proceedings are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. |
Keywords: | Qur'anic recitation; qalqalah prosody; regular expressions |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) > Artificial Intelligence & Biological Systems (Leeds) |
Funding Information: | Funder Grant number EPSRC EP/K015206/1 |
Depositing User: | Symplectic Publications |
Date Deposited: | 01 Aug 2016 13:33 |
Last Modified: | 01 Aug 2016 13:33 |
Published Version: | http://www.lrec-conf.org/proceedings/lrec2014/inde... |
Status: | Published |
Publisher: | European Language Resources Association |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:100843 |