2016.iwslt-1.12@ACL

Total: 1

#1 Microsoft Speech Language Translation (MSLT) Corpus: The IWSLT 2016 release for English, French and German [PDF] [Copy] [Kimi1]

Authors: Christian Federmann ; William D. Lewis

We describe the Microsoft Speech Language Translation (MSLT) corpus, which was created in order to evaluate end-to-end conversational speech translation quality. The corpus was created from actual conversations over Skype, and we provide details on the recording setup and the different layers of associated text data. The corpus release includes Test and Dev sets with reference transcripts for speech recognition. Additionally, cleaned up transcripts and reference translations are available for evaluation of machine translation quality. The IWSLT 2016 release described here includes the source audio, raw transcripts, cleaned up transcripts, and translations to or from English for both French and German.