Huang, X.A., La Malfa, E., Marro, S. et al. (3 more authors) (2024) A Notion of Complexity for Theory of Mind via Discrete World Models. In: Al-Onaizan, Y., Bansal, M. and Chen, Y.-N., (eds.) Findings of the Association for Computational Linguistics: EMNLP 2024. EMNLP 2024: Conference on Empirical Methods in Natural Language Processing, 12-16 Nov 2024, Miami, USA. Association for Computational Linguistics (ACL), pp. 2964-2983. ISBN: 979-8-89176-168-1.
Abstract
Theory of Mind (ToM) can be used to assess the capabilities of Large Language Models (LLMs) in complex scenarios where social reasoning is required. While the research community has proposed many ToM benchmarks, their hardness varies greatly, and their complexity is not well defined. This work proposes a framework inspired by cognitive load theory to measure the complexity of ToM tasks. We quantify a problem's complexity as the number of states necessary to solve it correctly. Our complexity measure also accounts for spurious states of a ToM problem designed to make it apparently harder. We use our method to assess the complexity of five widely adopted ToM benchmarks. On top of this framework, we design a prompting technique that augments the information available to a model with a description of how the environment changes with the agents' interactions. We name this technique Discrete World Models (DWM) and show how it elicits superior performance on ToM tasks.
Metadata
| Item Type: | Proceedings Paper |
|---|---|
| Authors/Creators: |
|
| Editors: |
|
| Copyright, Publisher and Additional Information: | © 1963–2026 ACL. This is an open access conference paper under the terms of the Creative Commons Attribution License (CC-BY-NC-SA 3.0), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. |
| Dates: |
|
| Institution: | The University of Leeds |
| Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
| Date Deposited: | 16 Feb 2026 16:00 |
| Last Modified: | 16 Feb 2026 16:00 |
| Status: | Published |
| Publisher: | Association for Computational Linguistics (ACL) |
| Identification Number: | 10.18653/v1/2024.findings-emnlp.167 |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:237884 |

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)