A rapid evidence review of evaluation techniques for large language models in legal use cases: trends, gaps, and recommendations for future research

Kelsall, J., Tan, X., Bergin, A. et al. (7 more authors) (2025) A rapid evidence review of evaluation techniques for large language models in legal use cases: trends, gaps, and recommendations for future research. AI & Society.

Abstract

Metadata

Item Type: Article
Authors/Creators:
Copyright, Publisher and Additional Information:

© 2025 The Authors. This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Keywords: AI and Law; Legal AI; AI benchmarking; AI Review; AI Metrics; Evaluation
Dates:
  • Accepted: 10 November 2025
  • Published (online): 21 November 2025
  • Published: 21 November 2025
Institution: The University of Sheffield
Academic Units: The University of Sheffield > Faculty of Arts and Humanities (Sheffield) > School of Law
Funding Information:
Funder
Grant number
RESPONSIBLE AI UK
EP/Y009800/1
RESPONSIBLE AI UK / RAI UK
UNSPECIFIED
Date Deposited: 24 Nov 2025 12:05
Last Modified: 24 Nov 2025 12:05
Published Version: https://doi.org/10.1007/s00146-025-02741-9
Status: Published online
Publisher: Springer Verlag
Refereed: Yes
Identification Number: 10.1007/s00146-025-02741-9
Related URLs:
Open Archives Initiative ID (OAI ID):

Export

Statistics