Improving Word Alignment with Language Model Based Confidence Scores

Abstract

This paper describes the statistical machine translation systems submitted to the ACL-WMT 2008 shared translation task. Systems were submitted for two translation directions: English-Spanish and Spanish-English. Using sentence pair confidence scores estimated with source and target language models, improvements are observed on the News-Commentary test sets. Genre-dependent sentence pair confidence score and integration of sentence pair confidence score into phrase table are also investigated.

Publication
Proceedings of the Third Workshop on Statistical Machine Translation