Improved speech-to-text translation with the Fisher and Callhome Spanish-English speech translation corpus

#1 Improved speech-to-text translation with the Fisher and Callhome Spanish-English speech translation corpus [PDF] [Copy] [Kimi¹] [REL]

Authors: Matt Post, Gaurav Kumar, Adam Lopez, Damianos Karakos, Chris Callison-Burch, Sanjeev Khudanpur

Research into the translation of the output of automatic speech recognition (ASR) systems is hindered by the dearth of datasets developed for that explicit purpose. For SpanishEnglish translation, in particular, most parallel data available exists only in vastly different domains and registers. In order to support research on cross-lingual speech applications, we introduce the Fisher and Callhome Spanish-English Speech Translation Corpus, supplementing existing LDC audio and transcripts with (a) ASR 1-best, lattice, and oracle output produced by the Kaldi recognition system and (b) English translations obtained on Amazon’s Mechanical Turk. The result is a four-way parallel dataset of Spanish audio, transcriptions, ASR lattices, and English translations of approximately 38 hours of speech, with defined training, development, and held-out test sets. We conduct baseline machine translation experiments using models trained on the provided training data, and validate the dataset by corroborating a number of known results in the field, including the utility of in-domain (information, conversational) training data, increased performance translating lattices (instead of recognizer 1-best output), and the relationship between word error rate and BLEU score.

Subject: IWSLT.2013 - Papers

2013.iwslt-papers.14@ACL

#1 Improved speech-to-text translation with the Fisher and Callhome Spanish-English speech translation corpus [PDF] [Copy] [Kimi1] [REL]

#1 Improved speech-to-text translation with the Fisher and Callhome Spanish-English speech translation corpus [PDF] [Copy] [Kimi¹] [REL]