Peng, X. orcid.org/0000-0001-5787-9982, Lin, C. orcid.org/0000-0003-3454-2468, Stevenson, M. orcid.org/0000-0002-9483-6006 et al. (1 more author) (Submitted: 2020) Revisiting the linearity in cross-lingual embedding mappings : from a perspective of word analogies. arXiv. (Submitted)
Abstract
Most cross-lingual embedding mapping algorithms assume the optimised transformation functions to be linear. Recent studies showed that on some occasions, learning a linear mapping does not work, indicating that the commonly-used assumption may fail. However, it still remains unclear under which conditions the linearity of cross-lingual embedding mappings holds. In this paper, we rigorously explain that the linearity assumption relies on the consistency of analogical relations encoded by multilingual embeddings. We did extensive experiments to validate this claim. Empirical results based on the analogy completion benchmark and the BLI task demonstrate a strong correlation between whether mappings capture analogical information and are linear.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2020 The Author(s). For reuse permissions, please contact the Author(s). |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 12 Aug 2021 13:43 |
Last Modified: | 12 Aug 2021 16:31 |
Published Version: | https://arxiv.org/abs/2004.01079v1 |
Status: | Submitted |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:177033 |