2010.iwslt-evaluation.5@ACL

Total: 1

#1 FBK @ IWSLT 2010 [PDF] [Copy] [Kimi1]

Authors: Arianna Bisazza ; Ioannis Klasinas ; Mauro Cettolo ; Marcello Federico

This year FBK took part in the BTEC translation task, with source languages Arabic and Turkish and target language English, and in the new TALK task, source English and target French. We worked in the framework of phrase-based statistical machine translation aiming to improve coverage of models in presence of rich morphology, on one side, and to make better use of available resources through data selection techniques. New morphological segmentation rules were developed for Turkish-English. The combination of several Turkish segmentation schemes into a lattice input led to an improvement wrt to last year. The use of additional training data was explored for Arabic-English, while on the English to French task improvement was achieved over a strong baseline by automatically selecting relevant and high quality data from the available training corpora.