2014.iwslt-papers.12@ACL

Total: 1

#1 Offline extraction of overlapping phrases for hierarchical phrase-based translation [PDF] [Copy] [Kimi1]

Authors: Sariya Karimova ; Patrick Simianer ; Stefan Riezler

Standard SMT decoders operate by translating disjoint spans of input words, thus discarding information in form of overlapping phrases that is present at phrase extraction time. The use of overlapping phrases in translation may enhance fluency in positions that would otherwise be phrase boundaries, they may provide additional statistical support for long and rare phrases, and they may generate new phrases that have never been seen in the training data. We show how to extract overlapping phrases offline for hierarchical phrasebased SMT, and how to extract features and tune weights for the new phrases. We find gains of 0.3 − 0.6 BLEU points over discriminatively trained hierarchical phrase-based SMT systems on two datasets for German-to-English translation.