Leite, J.A., Razuvayevskaya, O., Bontcheva, K. orcid.org/0000-0001-6152-9600 et al. (1 more author) (Submitted: 2023) Detecting misinformation with LLM-predicted credibility signals and weak supervision. [Preprint - arXiv] (Submitted)
Abstract
Credibility signals represent a wide range of heuristics that are typically used by journalists and fact-checkers to assess the veracity of online content. Automating the task of credibility signal extraction, however, is very challenging as it requires high-accuracy signal-specific extractors to be trained, while there are currently no sufficiently large datasets annotated with all credibility signals. This paper investigates whether large language models (LLMs) can be prompted effectively with a set of 18 credibility signals to produce weak labels for each signal. We then aggregate these potentially noisy labels using weak supervision in order to predict content veracity. We demonstrate that our approach, which combines zero-shot LLM credibility signal labeling and weak supervision, outperforms state-of-the-art classifiers on two misinformation datasets without using any ground-truth labels for training. We also analyse the contribution of the individual credibility signals towards predicting content veracity, which provides new valuable insights into their role in misinformation detection.
Metadata
Item Type: | Preprint |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2023 The Author(s). For reuse permissions, please contact the Author(s). |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 14 Feb 2025 11:59 |
Last Modified: | 14 Feb 2025 11:59 |
Published Version: | https://arxiv.org/abs/2309.07601v1 |
Status: | Submitted |
Identification Number: | 10.48550/arXiv.2309.07601 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:223248 |