Recommending Machine Translation Output to Translators by Estimating Translation Effort: A Case Study

Authors: Prashant Mathur, Nick Ruiz, and Marcello Federico

Polibits, Vol. 47, pp. 47-53, 2013.

Abstract: In this paper we use the statistics provided by a field experiment to explore the utility of supplying machine translation suggestions in a computer-assisted translation (CAT) environment. Regression models are trained for each user in order to estimate the time to edit (TTE) for the current translation segment. We use a combination of features from the current segment and aggregated features from formerly translated segments selected with content-based filtering approaches commonly used in recommendation systems. We present and evaluate decision function heuristics to determine if machine translation output will be useful for the translator in the given segment. We find that our regression models do a reasonable job for some users in predicting TTE given only a small number of training examples; although noise in the actual TTE for seemingly similar segments yields large error margins. We propose to include the estimation of TTE in CAT recommendation systems as a well-correlated metric for translation quality.

Keywords: Machine translation, computer-assisted translation, quality estimation, recommender systems

PDF: Recommending Machine Translation Output to Translators by Estimating Translation Effort: A Case Study
PDF: Recommending Machine Translation Output to Translators by Estimating Translation Effort: A Case Study