2012.iwslt-evaluation.2@ACL

Total: 1

#1 The NICT ASR system for IWSLT2012 [PDF] [Copy] [Kimi1]

Authors: Hitoshi Yamamoto ; Youzheng Wu ; Chien-Lin Huang ; Xugang Lu ; Paul R. Dixon ; Shigeki Matsuda ; Chiori Hori ; Hideki Kashioka

This paper describes our automatic speech recognition (ASR) system for the IWSLT 2012 evaluation campaign. The target data of the campaign is selected from the TED talks, a collection of public speeches on a variety of topics spoken in English. Our ASR system is based on weighted finite-state transducers and exploits an combination of acoustic models for spontaneous speech, language models based on n-gram and factored recurrent neural network trained with effectively selected corpora, and unsupervised topic adaptation framework utilizing ASR results. Accordingly, the system achieved 10.6% and 12.0% word error rate for the tst2011 and tst2012 evaluation set, respectively.