Alahmari, S. orcid.org/0009-0002-6490-3295, Atwell, E., Saadany, H. et al. (1 more author) (2025) Arabic-Centric Large Language Models for Dialectal Arabic Sentiment Analysis Task. In: Alharbi, M. I., Chafik, S., Ezzini, S., Mitkov, R., Ranasinghe, T. and Hettiarachchi, H., (eds.) Proceedings of the Shared Task on Sentiment Analysis for Arabic Dialects. Sentiment Analysis on Arabic Dialects in the Hospitality Domain: A Multi-Dialect Benchmark Shared Task, 11-13 Sep 2025, Varna, Bulgaria. ACL Anthology, pp. 69-75. ISBN: 978-954-452-109-7.
Abstract
This paper presents a study on sentiment analysis of Dialectal Arabic (DA), with a particular focus on Saudi and Moroccan (Darija) dialects within the hospitality domain. We introduce a novel dataset comprising 698 Saudi Arabian proverbs annotated with sentiment polarity labels—Positive, Negative, and Neutral—collected from five major Saudi dialect regions: Najdi, Hijazi, Shamali, Janoubi, and Sharqawi. In addition to this, we used customer reviews for fine-tuning the CAMeLBERT-DASA model, which achieved a 75% F1 score in sentiment classification. To further evaluate the robustness of Arabic-centric models, we assessed the performance of three open-source large language models—Allam, ACeGPT, and Jais—in a zero-shot setting using the Ahasis shared task test set. Our results highlight the effectiveness of domain-specific fine-tuning in improving sentiment analysis performance and demonstrate the potential of Arabic-centric LLMs in zero-shot scenarios. This work contributes new linguistic resources and empirical insights to support ongoing research in sentiment analysis for Arabic dialects.
Metadata
| Item Type: | Proceedings Paper |
|---|---|
| Authors/Creators: |
|
| Editors: |
|
| Dates: |
|
| Institution: | The University of Leeds |
| Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
| Date Deposited: | 25 Nov 2025 13:48 |
| Last Modified: | 25 Nov 2025 13:55 |
| Published Version: | https://acl-bg.org/proceedings/2025/AHaSIS-ST%2020... |
| Status: | Published |
| Publisher: | ACL Anthology |
| Identification Number: | 10.26615/978-954-452-109-7-011 |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:234850 |

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)