Shah, K., Ng, R.W.M., Bougares, F. et al. (1 more author) (2015) Investigating continuous space language models for machine translation quality estimation. In: Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing. EMNLP 2015, September 17–21 2015, Lisbon, Portugal. ACL , pp. 1073-1078. ISBN 9781941643327
Abstract
We present novel features designed with a deep neural network for Machine Translation (MT) Quality Estimation (QE). The features are learned with a Continuous Space Language Model to estimate the probabilities of the source and target segments. These new features, along with standard MT system-independent features, are benchmarked on a series of datasets with various quality labels, including postediting effort, human translation edit rate, post-editing time and METEOR. Results show significant improvements in prediction over the baseline, as well as over systems trained on state of the art feature sets for all datasets. More notably, the addition of the newly proposed features improves over the best QE systems in WMT12 and WMT14 by a significant margin.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2015 Association for Computational Linguistics. This article is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License (http://creativecommons.org/licenses/by-nc-sa/3.0/) |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 23 May 2016 13:54 |
Last Modified: | 23 May 2016 13:54 |
Status: | Published |
Publisher: | ACL |
Refereed: | Yes |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:98287 |