2012.iwslt-evaluation.6@ACL

Total: 1

#1 FBK’s machine translation systems for IWSLT 2012’s TED lectures [PDF] [Copy] [Kimi1]

Authors: N. Ruiz ; A. Bisazza ; R. Cattoni ; M. Federico

This paper reports on FBK’s Machine Translation (MT) submissions at the IWSLT 2012 Evaluation on the TED talk translation tasks. We participated in the English-French and the Arabic-, Dutch-, German-, and Turkish-English translation tasks. Several improvements are reported over our last year baselines. In addition to using fill-up combinations of phrase-tables for domain adaptation, we explore the use of corpora filtering based on cross-entropy to produce concise and accurate translation and language models. We describe challenges encountered in under-resourced languages (Turkish) and language-specific preprocessing needs.