CUED Publications database

N-gram posterior probability confidence measures for statistical machine translation: An empirical study

De Gispert, A and Blackwood, G and Iglesias, G and Byrne, W (2013) N-gram posterior probability confidence measures for statistical machine translation: An empirical study. Machine Translation, 27. pp. 85-114. ISSN 0922-6567

Full text not available from this repository.

Abstract

We report an empirical study of n-gram posterior probability confidence measures for statistical machine translation (SMT). We first describe an efficient and practical algorithm for rapidly computing n-gram posterior probabilities from large translation word lattices. These probabilities are shown to be a good predictor of whether or not the n-gram is found in human reference translations, motivating their use as a confidence measure for SMT. Comprehensive n-gram precision and word coverage measurements are presented for a variety of different language pairs, domains and conditions. We analyze the effect on reference precision of using single or multiple references, and compare the precision of posteriors computed from k-best lists to those computed over the full evidence space of the lattice. We also demonstrate improved confidence by combining multiple lattices in a multi-source translation framework. © 2012 The Author(s).

Item Type: Article
Subjects: UNSPECIFIED
Divisions: Div F > Machine Intelligence
Depositing User: Cron Job
Date Deposited: 17 Jul 2017 19:14
Last Modified: 10 Aug 2017 01:36
DOI: