Tarmom, T orcid.org/0000-0002-2834-461X, Atwell, E and Alsalka, M (2022) Deep Learning vs Compression-Based vs Traditional Machine Learning Classifiers to Detect Hadith Authenticity. In: Lossio-Ventura, JA, Valverde-Rebaza, J, Díaz, E, Muñante, D, Gavidia-Calderon, C, Valejo, ADB and Alatrista-Salas, H, (eds.) Information Management and Big Data. 8th International Conference on Information Management and Big Data, SIMBig 2021, 01-03 Dec 2021, Online. Communications in Computer and Information Science, 1577 . Springer , pp. 206-222. ISBN 978-3-031-04446-5
Abstract
Due to the increasing numbers of Hadith forgeries, it has become necessary to use artificial intelligence to assist those looking for authentic Hadiths. This paper presents detailed research on ways to automatically detect Hadith authenticity in Arabic Hadith texts. It examines the utilization of deep learning-based and prediction by partial matching (PPM) compression-based classifiers, which have not been previously used in detecting Hadith authenticity. The proposed methods were compared with the most recent method used which is machine learning. In addition, there is a detailed description of the new Arabic Hadith corpus (non-authentic Hadith corpus) created for this study and the authors’ experiments, which also used the Leeds University and King Saud University (LK) Hadith corpus. The experiments demonstrate that the authentication based on Isnad obtained accuracy ranging from 84% to 93%. The authentication based on Matan obtained an accuracy range of 55% to 93%, while the accuracy range for this experiment was from 55% to 85%, which means that Isnad is the most effective part of Hadith for automatically detecting authenticity. Moreover, the experiment proved that Matan can be used to judge Hadith authenticity with an accuracy of 85%. The study also showed that PPM and deep learning classifiers are effective means of automatically detecting authentic Hadith.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | © 2022 The Author(s). This is an author produced version of a conference paper published in Information Management and Big Data. Uploaded in accordance with the publisher's self-archiving policy. |
Keywords: | Hadith authenticity; Hadith corpus; Deep learning; Arabic natural language processing |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 11 Apr 2022 12:21 |
Last Modified: | 14 Nov 2023 20:38 |
Status: | Published |
Publisher: | Springer |
Series Name: | Communications in Computer and Information Science |
Identification Number: | 10.1007/978-3-031-04447-2_14 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:185583 |