Advancing Spatial Reasoning in Large Language Models: An In-Depth Evaluation and Enhancement Using the StepGame Benchmark.

This is the latest version of this eprint.

Li, F. orcid.org/0000-0002-1109-6285, Hogg, D.C. orcid.org/0000-0002-6125-9564 and Cohn, A.G. orcid.org/0000-0002-7652-8907 (2024) Advancing Spatial Reasoning in Large Language Models: An In-Depth Evaluation and Enhancement Using the StepGame Benchmark. In: Wooldridge, M.J., Dy, J.G. and Natarajan, S., (eds.) Proceedings of the AAAI Conference on Artificial Intelligence. Thirty-Eighth AAAI Conference on Artificial Intelligence, 20-27 Feb 2024, Vancouver, Canada. AAAI , pp. 18500-18507. ISBN 978-1-57735-887-9

Abstract

Metadata

Item Type: Proceedings Paper
Authors/Creators:
Editors:
  • Wooldridge, M.J.
  • Dy, J.G.
  • Natarajan, S.
Keywords: NLP: Interpretability, Analysis, and Evaluation of NLP Models; DMKM: Mining of Spatial, Temporal or Spatio-Temporal Data; NLP: (Large) Language Models; NLP: Other; PRS: Model-Based Reasoning; PRS: Optimization of Spatio-temporal Systems
Dates:
  • Published: 24 March 2024
  • Published (online): 24 March 2024
Institution: The University of Leeds
Academic Units: The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) > Artificial Intelligence
Funding Information:
Funder
Grant number
Alan Turing Institute
Not Known
Foreign Commonwealth and Development Office
Not Known
Depositing User: Symplectic Publications
Date Deposited: 17 Apr 2024 10:34
Last Modified: 24 Jan 2025 11:25
Status: Published
Publisher: AAAI
Identification Number: 10.1609/aaai.v38i17.29811
Related URLs:
Open Archives Initiative ID (OAI ID):

Available Versions of this Item

Download not available

A full text copy of this item is not currently available from White Rose Research Online

Export

Statistics