2010.iwslt-evaluation.15@ACL

Total: 1

#1 The MIRACL Arabic-English statistical machine translation system for IWSLT 2010 [PDF] [Copy] [Kimi1]

Authors: Ines Turki Khemakhem ; Salma Jamoussi ; Abdelmajid Ben Hamadou

This paper describes the MIRACL statistical Machine Translation system and the improvements that were developed during the IWSLT 2010 evaluation campaign. We participated to the Arabic to English BTEC tasks using a phrase-based statistical machine translation approach. In this paper, we first discuss some challenges in translating from Arabic to English and we explore various techniques to improve performances on a such task. Next, we present our solution for disambiguating the output of an Arabic morphological analyzer. In fact, The Arabic morphological analyzer used produces all possible morphological structures for each word, with an unique correct proposition. In this work we exploit the Arabic-English alignment to choose the correct segmented form and the correct morpho-syntactic features produced by our morphological analyzer.