Gao, R., Wu, X., Kuribayashi, T. et al. (6 more authors) (2025) Can LLMs simulate L2-English dialogue? An information-theoretic analysis of L1-dependent biases. In: Che, W., Nabende, J., Shutova, E. and Pilehvar, M.T., (eds.) Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics. 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), 27 Jul - 01 Aug 2025, Vienna, Austria. Association for Computational Linguistics, pp. 4355-4379. ISSN: 0736-587X. EISSN: UNSPECIFIED.
Abstract
This study evaluates Large Language Models' (LLMs) ability to simulate non-native English use as observed in human second language (L2) learners interfered with by their native first language (L1). In dialogue-based interviews, we prompt LLMs to mimic L2 English learners with specific L1s (e.g., Japanese, Thai, Urdu) across seven languages, comparing their outputs to real L2 learner data. Our analysis examines L1-driven linguistic biases, such as reference word usage and avoidance behaviors, using information-theoretic and distributional density measures. Results show that modern LLMs (e.g., Qwen2.5, LLAMA3, DeepseekV3, GPT-4o) replicate L1-dependent patterns observed in human L2 data, with distinct influences from various languages (e.g., Japanese, Korean, and Mandarin significantly affect tense agreement, while Urdu influences noun-verb collocations). Our results reveal LLMs' potential for L2 dialogue simulation and evaluation for future educational applications.
Metadata
| Item Type: | Proceedings Paper |
|---|---|
| Authors/Creators: |
|
| Editors: |
|
| Copyright, Publisher and Additional Information: | © ACL 2025. This is an Open Access paper distributed under the terms of the Creative Commons Attribution Licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
| Dates: |
|
| Institution: | The University of Sheffield |
| Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
| Date Deposited: | 19 Nov 2025 09:49 |
| Last Modified: | 20 Nov 2025 09:20 |
| Status: | Published |
| Publisher: | Association for Computational Linguistics |
| Refereed: | Yes |
| Identification Number: | 10.18653/v1/2025.acl-long.219 |
| Related URLs: | |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:234662 |
Download
Filename: 2025.acl-long.219.pdf
Licence: CC-BY 4.0

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)