This is the latest version of this eprint.
Yamaguchi, A. orcid.org/0000-0001-8327-7598, Morishita, T., Villavicencio, A. et al. (1 more author) (Accepted: 2026) Mitigating catastrophic forgetting in target language adaptation of LLMs via Source-Shielded Updates. In: Proceedings of 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026). 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), 02-07 Jul 2026, San Diego, California. . Association for Computational Linguistics (ACL). (In Press)
Abstract
Expanding the linguistic diversity of instruct large language models (LLMs) is crucial for global accessibility but is often hindered by the reliance on costly specialized target language labeled data and catastrophic forgetting during adaptation. We tackle this challenge under a realistic, low-resource constraint: adapting instruct LLMs using only unlabeled target language data. We introduce Source-Shielded Updates (SSU), a selective parameter update strategy that proactively preserves source knowledge. Using a small set of source data and a parameter importance scoring method, SSU identifies parameters critical to maintaining source abilities. It then applies a column-wise freezing strategy to protect these parameters before adaptation. Experiments across five typologically diverse languages and 7B and 13B models demonstrate that SSU successfully mitigates catastrophic forgetting. It reduces performance degradation on monolingual source tasks to just 3.4% (7B) and 2.8% (13B) on average, a stark contrast to the 20.3% and 22.3% from full fine-tuning. SSU also achieves target-language performance highly competitive with full fine-tuning, outperforming it on all benchmarks for 7B models and the majority for 13B models.
Metadata
| Item Type: | Proceedings Paper |
|---|---|
| Authors/Creators: |
|
| Copyright, Publisher and Additional Information: | © 2026 Association for Computational Linguistics. |
| Dates: |
|
| Institution: | The University of Sheffield |
| Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
| Funding Information: | Funder Grant number Engineering and Physical Sciences Research Council 2894795 |
| Date Deposited: | 06 May 2026 13:53 |
| Last Modified: | 06 May 2026 13:53 |
| Status: | In Press |
| Publisher: | Association for Computational Linguistics (ACL) |
| Refereed: | Yes |
| Related URLs: | |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:240803 |
Available Versions of this Item
-
Mitigating catastrophic forgetting in target language adaptation of LLMs via Source-Shielded Updates. (deposited 06 May 2026 13:43)
- Mitigating catastrophic forgetting in target language adaptation of LLMs via Source-Shielded Updates. (deposited 06 May 2026 13:53) [Currently Displayed]
Download
Filename: ssu.pdf

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)