Conference Proceedings
Word embeddings and discourse information for Machine Translation Quality Estimation
C Scarton, D Beck, K Shah, KS Smith, L Specia
Proceedings of the Annual Meeting of the Association for Computational Linguistics | Published : 2016
DOI: 10.18653/v1/w16-2391
Abstract
In this paper we present the results of the University of Sheffield (SHEF) submissions for the WMT16 shared task on document-level Quality Estimation (Task 3). Our submission explore discourse and document-aware information and word embeddings as features, with Support Vector Regression and Gaussian Process used to train the Quality Estimation models. The use of word embeddings (combined with baseline features) and a Gaussian Process model with two kernels led to the winning submission in the shared task.
Grants
Awarded by Horizon 2020 Framework Programme