Navigating prompt complexity for zero-shot classification: a study of large language models in computational social science

Mu, Y., Wu, B.P., Thorne, W. orcid.org/0000-0002-8947-6261 et al. (5 more authors) (2024) Navigating prompt complexity for zero-shot classification: a study of large language models in computational social science. In: Calzolari, N., Kan, M-Y., Hoste, V., Lenci, A., Sakti, S. and Xue, N., (eds.) Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 20-25 May 2024, Torino, Italy. ELRA and ICCL , pp. 12074-12086. ISBN 978-2-493814-10-4

Abstract

Instruction-tuned Large Language Models (LLMs) have exhibited impressive language understanding and the capacity to generate responses that follow specific prompts. However, due to the computational demands associated with training these models, their applications often adopt a zero-shot setting. In this paper, we evaluate the zero-shot performance of two publicly accessible LLMs, ChatGPT and OpenAssistant, in the context of six Computational Social Science classification tasks, while also investigating the effects of various prompting strategies. Our experiments investigate the impact of prompt complexity, including the effect of incorporating label definitions into the prompt; use of synonyms for label names; and the influence of integrating past memories during foundation model training. The findings indicate that in a zero-shot setting, current LLMs are unable to match the performance of smaller, fine-tuned baseline transformer models (such as BERT-large). Additionally, we find that different prompting strategies can significantly affect classification accuracy, with variations in accuracy and F1 scores exceeding 10%.

Metadata

Item Type:	Proceedings Paper
Authors/Creators:	Mu, Y. Wu, B.P. Thorne, W. https://orcid.org/0000-0002-8947-6261 Robinson, A. Aletras, N. https://orcid.org/0000-0003-4285-1965 Scarton, C. https://orcid.org/0000-0002-0103-4072 Bontcheva, K. https://orcid.org/0000-0001-6152-9600 Song, X. https://orcid.org/0000-0002-4188-6974
Editors:	Calzolari, N. Kan, M-Y. Hoste, V. Lenci, A. Sakti, S. Xue, N.
Copyright, Publisher and Additional Information:	© 2024 ELRA Language Resource Association. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-commercial Licence (https://creativecommons.org/licenses/by-nc/4.0/).
Keywords:	Large Language Model; Computational Social Science; Prompt Complexity
Dates:	Published (online): May 2024 Published: May 2024
Institution:	The University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Depositing User:	Symplectic Sheffield
Date Deposited:	13 Feb 2025 14:42
Last Modified:	14 Feb 2025 09:46
Published Version:	https://aclanthology.org/2024.lrec-main.1055/
Status:	Published
Publisher:	ELRA and ICCL
Refereed:	Yes
Related URLs:	Conference
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:223236

Download

Published Version

Filename: 2024.lrec-main.1055.pdf

Licence: CC-BY-NC 4.0

CLICK TO DOWNLOAD

CORE (COnnecting REpositories)

Navigating prompt complexity for zero-shot classification: a study of large language models in computational social science

Abstract

Metadata

Download

Published Version

Export

Statistics