2008.iwslt-evaluation.17@ACL

Total: 1

#1 The TALP&I2R SMT systems for IWSLT 2008. [PDF] [Copy] [Kimi1]

Authors: Maxim Khalilov ; Maria R. Costa-jussà ; Carlos A. Henríquez Q. ; José A. R. Fonollosa ; Adolfo Hernández H. ; José B. Mariño ; Rafael E. Banchs ; Chen Boxing ; Min Zhang ; Aiti Aw ; Haizhou Li

This paper gives a description of the statistical machine translation (SMT) systems developed at the TALP Research Center of the UPC (Universitat Polite`cnica de Catalunya) for our participation in the IWSLT’08 evaluation campaign. We present Ngram-based (TALPtuples) and phrase-based (TALPphrases) SMT systems. The paper explains the 2008 systems’ architecture and outlines translation schemes we have used, mainly focusing on the new techniques that are challenged to improve speech-to-speech translation quality. The novelties we have introduced are: improved reordering method, linear combination of translation and reordering models and new technique dealing with punctuation marks insertion for a phrase-based SMT system. This year we focus on the Arabic-English, Chinese-Spanish and pivot Chinese-(English)-Spanish translation tasks.