Fomicheva, M., Specia, L. orcid.org/0000-0002-5495-3128 and Guzmán, F. (2020) Multi-hypothesis machine translation evaluation. In: Jurafsky, D., Chai, J., Schluter, N. and Tetreault, J., (eds.) Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 58th Annual Meeting of the Association for Computational Linguistics, 05-10 Jul 2020, Seattle, Washington. Association for Computational Linguistics , pp. 1218-1232. ISBN 9781952148255
Abstract
Reliably evaluating Machine Translation (MT) through automated metrics is a long-standing problem. One of the main challenges is the fact that multiple outputs can be equally valid. Attempts to minimise this issue include metrics that relax the matching of MT output and reference strings, and the use of multiple references. The latter has been shown to significantly improve the performance of evaluation metrics. However, collecting multiple references is expensive and in practice a single reference is generally used. In this paper, we propose an alternative approach: instead of modelling linguistic variation in human reference we exploit the MT model uncertainty to generate multiple diverse translations and use these: (i) as surrogates to reference translations; (ii) to obtain a quantification of translation variability to either complement existing metric scores or (iii) replace references altogether. We show that for a number of popular evaluation metrics our variability estimates lead to substantial improvements in correlation with human judgements of quality by up 15%.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | © 2020 Association for Computational Linguistics. Published under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/). |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 02 Nov 2020 09:34 |
Last Modified: | 02 Nov 2020 09:34 |
Published Version: | https://www.aclweb.org/anthology/2020.acl-main.113... |
Status: | Published |
Publisher: | Association for Computational Linguistics |
Refereed: | Yes |
Identification Number: | 10.18653/v1/2020.acl-main.113 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:167472 |