Word embeddings and discourse information for Machine Translation Quality Estimation

C Scarton; D Beck; K Shah; KS Smith; L Specia

Conference Proceedings

Word embeddings and discourse information for Machine Translation Quality Estimation

C Scarton, D Beck, K Shah, KS Smith, L Specia

Proceedings of the Annual Meeting of the Association for Computational Linguistics | Published : 2016

DOI: 10.18653/v1/w16-2391

Abstract

In this paper we present the results of the University of Sheffield (SHEF) submissions for the WMT16 shared task on document-level Quality Estimation (Task 3). Our submission explore discourse and document-aware information and word embeddings as features, with Support Vector Regression and Gaussian Process used to train the Quality Estimation models. The use of word embeddings (combined with baseline features) and a Gaussian Process model with two kernels led to the winning submission in the shared task.

University of Melbourne Researchers

Daniel Beck Author

Grants

Awarded by Horizon 2020 Framework Programme

Citation metrics

12Scopus