An oblivious approach to machine translation quality estimation

Itamar Elmakias, Dan Vilenchik

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Machine translation (MT) is being used by millions of people daily, and therefore evaluating the quality of such systems is an important task. While human expert evaluation of MT output remains the most accurate method, it is not scalable by any means. Automatic procedures that perform the task of Machine Translation Quality Estimation (MT-QE) are typically trained on a large corpus of source–target sentence pairs, which are labeled with human judgment scores. Furthermore, the test set is typically drawn from the same distribution as the train. However, recently, interest in low-resource and unsupervised MT-QE has gained momentum. In this paper, we define and study a further restriction of the unsupervised MT-QE setting that we call oblivious MT-QE. Besides having no access no human judgment scores, the algorithm has no access to the test text’s distribution. We propose an oblivious MT-QE system based on a new notion of sentence cohesiveness that we introduce. We tested our system on standard competition datasets for various language pairs. In all cases, the performance of our system was comparable to the performance of the non-oblivious baseline system provided by the competition organizers. Our results suggest that reasonable MT-QE can be carried out even in the restrictive oblivious setting.

Original languageEnglish
Article number2090
JournalMathematics
Volume9
Issue number17
DOIs
StatePublished - 1 Sep 2021

Keywords

  • Machine translation
  • Quality estimation
  • Unsupervised learning

ASJC Scopus subject areas

  • Mathematics (all)

Fingerprint

Dive into the research topics of 'An oblivious approach to machine translation quality estimation'. Together they form a unique fingerprint.

Cite this