Dukes, K and Atwell, E (2012) LAMP: a multimodal web platform for collaborative linguistic analysis. In: Chair, NCC, Choukri, K, Declerck, T, an, MUUD, Maegaard, B, Mariani, J, Odijk, J and Piperidis, S, (eds.) Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12). LREC 2012, Eighth International Conference on Language Resources and Evaluation, May 21-27, 2012, Istanbul Lüfti Kirdar Convention & Exhibition Centre Istanbul, Turkey. European Language Resources Association (ELRA) , 3268 - 3275. ISBN 978-2-9517408-7-7
Abstract
his paper describes the underlying software platform used to develop and publish annotations for the Quranic Arabic Corpus (QAC). The QAC (Dukes, Atwell and Habash, 2011) is a multimodal language resource that integrates deep tagging, interlinear translation, multiple speech recordings, visualization and collaborative analysis for the Classical Arabic language of the Quran. Available online at http://corpus.quran.com, the website is a popular study guide for Quranic Arabic, used by over 1.2 million visitors over the past year. We provide a description of the underlying software system that has been used to develop the corpus annotations. The multimodal data is made available online through an accessible cross-referenced web interface. Although our Linguistic Analysis Multimodal Platform (LAMP), has been applied to the Classical Arabic language of the Quran, we argue that our annotation model and software architecture may be of interest to other related corpus linguistics projects. Work related to LAMP includes recent efforts for annotating other Classical languages, such as Ancient Greek and Latin (Bamman, Mambrini and Crane, 2009), as well as commercial systems (e.g. Logos Bible study) that provide access to syntactic tagging for the Hebrew Bible and Greek New Testament (Brannan, 2011).
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | (c) 2012, European Language Resources Association (ELRA). Reproduced with permission from the publisher. |
Keywords: | Arabic Corpus; treebank; Quran; collaborative annotation |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) > Artificial Intelligence & Biological Systems (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 26 Nov 2014 12:27 |
Last Modified: | 17 Jan 2018 08:00 |
Published Version: | http://www.lrec-conf.org/proceedings/lrec2012/inde... |
Status: | Published |
Publisher: | European Language Resources Association (ELRA) |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:81365 |