Madusanka, T., Valentino, M., Zahid, I. et al. (2 more authors) (2025) Unravelling the logic: Investigating the generalisation of Transformers in numerical satisfiability problems. In: Che, W., Nabende, J., Shutova, E. and Pilehvar, M.T., (eds.) Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 27 Jul - 01 Aug 2025, Vienna, Austria. Association for Computational Linguistics, pp. 25155-25168. ISBN: 9798891762510.
Abstract
Transformer models have achieved remarkable performance in many formal reasoning tasks. Nonetheless, the extent of their comprehension pertaining to logical semantics and rules of inference remains somewhat uncertain. Evaluating such understanding necessitates a rigorous examination of these models’ generalisation capacity to out-of-distribution data. In this study, we probe the generalisation prowess of Transformer models with respect to the hitherto unexplored domain of numerical satisfiability problems. Our investigation reveals that Transformers exhibit minimal scale and noise invariance, alongside limited vocabulary and number invariance. However, even when Transformer models experience a notable decline in performance on out-of-distribution test sets, they often still surpass the random baseline by a considerable margin.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | © 2025 Association for Computational Linguistics. Licensed on a Creative Commons Attribution 4.0 International License. (https://creativecommons.org/licenses/by/4.0/) |
Keywords: | Philosophy; Information and Computing Sciences; Artificial Intelligence; Philosophy and Religious Studies |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 05 Sep 2025 11:16 |
Last Modified: | 05 Sep 2025 11:16 |
Status: | Published |
Publisher: | Association for Computational Linguistics |
Refereed: | Yes |
Identification Number: | 10.18653/v1/2025.acl-long.1223 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:231202 |