Detecting misinformation with LLM-predicted credibility signals and weak supervision.

This is a preprint and may not have undergone formal peer review

Abstract

Credibility signals represent a wide range of heuristics that are typically used by journalists and fact-checkers to assess the veracity of online content. Automating the task of credibility signal extraction, however, is very challenging as it requires high-accuracy signal-specific extractors to be trained, while there are currently no sufficiently large datasets annotated with all credibility signals. This paper investigates whether large language models (LLMs) can be prompted effectively with a set of 18 credibility signals to produce weak labels for each signal. We then aggregate these potentially noisy labels using weak supervision in order to predict content veracity. We demonstrate that our approach, which combines zero-shot LLM credibility signal labeling and weak supervision, outperforms state-of-the-art classifiers on two misinformation datasets without using any ground-truth labels for training. We also analyse the contribution of the individual credibility signals towards predicting content veracity, which provides new valuable insights into their role in misinformation detection.

Metadata

Item Type:	Preprint
Authors/Creators:	Leite, J.A. Razuvayevskaya, O. Bontcheva, K. https://orcid.org/0000-0001-6152-9600 Scarton, C. https://orcid.org/0000-0002-0103-4072
Copyright, Publisher and Additional Information:	© 2023 The Author(s). For reuse permissions, please contact the Author(s).
Dates:	Submitted: 14 September 2023
Institution:	The University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Depositing User:	Symplectic Sheffield
Date Deposited:	14 Feb 2025 11:59
Last Modified:	14 Feb 2025 11:59
Published Version:	https://arxiv.org/abs/2309.07601v1
Status:	Submitted
Identification Number:	10.48550/arXiv.2309.07601
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:223248

CORE (COnnecting REpositories)

Detecting misinformation with LLM-predicted credibility signals and weak supervision.

Abstract

Metadata

Download

Preprint

Export

Statistics