Alshammeri, M orcid.org/0000-0002-4645-3991, Atwell, E orcid.org/0000-0001-9395-3764 and Alsalka, MA orcid.org/0000-0003-3335-1918 (2021) Classifying Verses of the Quran using Doc2vec. In: ACL Anthology. The 18th International Conference on Natural Language Processing (ICON2021), 16-19 Dec 2021, Online. NLP Association of India (NLPAI) , National Institute of Technology Silchar, Silchar, India , pp. 284-288.
Abstract
The Quran, as a significant religious text, bears important spiritual and linguistic values. Understanding the text and inferring the underlying meanings entails semantic similarity analysis. We classified the verses of the Quran into 15 pre-defined categories or concepts, based on the Qurany corpus, using Doc2Vec and Logistic Regression. Our classifier scored 70% accuracy, and 60% F1-score using the distributed bag-of-words architecture. We then measured how similar the documents within the same category are to each other semantically and use this information to evaluate our model. We calculated the mean difference and average similarity values for each category to indicate how well our model describes that category.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | ACL materials are Copyright © 1963–2024 ACL; other materials are copyrighted by their respective copyright holders. Materials prior to 2016 here are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License. Permission is granted to make copies for the purposes of teaching and research. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License. |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 04 Jan 2022 14:29 |
Last Modified: | 14 Feb 2024 15:13 |
Published Version: | https://aclanthology.org/2021.icon-main.34/ |
Status: | Published |
Publisher: | NLP Association of India (NLPAI) |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:181764 |
Download
Filename: Classifying Verses of the Quran using Doc2vec.pdf
Licence: CC-BY-NC-SA 3.0