Mu, Y., Song, X. orcid.org/0000-0002-4188-6974, Bontcheva, K. orcid.org/0000-0001-6152-9600 et al. (1 more author) (2024) Examining the limitations of computational rumor detection models trained on static datasets. In: Calzolari, N., Kan, M-Y., Hoste, V., Lenci, A., Sakti, S. and Xue, N., (eds.) 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings. 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 20-25 May 2024, Torino, Italy. ELRA and ICCL , pp. 6739-6751. ISBN 978-2-493814-10-4
Abstract
A crucial aspect of a rumor detection model is its ability to generalize, particularly its ability to detect emerging, previously unknown rumors. Past research has indicated that content-based (i.e., using solely source post as input) rumor detection models tend to perform less effectively on unseen rumors. At the same time, the potential of context-based models remains largely untapped. The main contribution of this paper is in the in-depth evaluation of the performance gap between content and context-based models specifically on detecting new, unseen rumors. Our empirical findings demonstrate that context-based models are still overly dependent on the information derived from the rumors' source post and tend to overlook the significant role that contextual information can play. We also study the effect of data split strategies on classifier performance. Based on our experimental results, the paper also offers practical suggestions on how to minimize the effects of temporal concept drift in static datasets during the training of rumor detection methods.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | © 2024 ELRA Language Resource Association. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-commercial Licence (https://creativecommons.org/licenses/by-nc/4.0/). |
Keywords: | Rumor Detection; Computational Social Science; Computational Misinformation Analysis |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 13 Feb 2025 15:25 |
Last Modified: | 14 Feb 2025 09:42 |
Published Version: | https://aclanthology.org/2024.lrec-main.595/ |
Status: | Published |
Publisher: | ELRA and ICCL |
Refereed: | Yes |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:223239 |