2013.iwslt-evaluation.6@ACL

Total: 1

#1 The NICT ASR system for IWSLT 2013 [PDF] [Copy] [Kimi1]

Authors: Chien-Lin Huang ; Paul R. Dixon ; Shigeki Matsuda ; Youzheng Wu ; Xugang Lu ; Masahiro Saiko ; Chiori Hori

This study presents the NICT automatic speech recognition (ASR) system submitted for the IWSLT 2013 ASR evaluation. We apply two types of acoustic features and three types of acoustic models to the NICT ASR system. Our system is comprised of six subsystems with different acoustic features and models. This study reports the individual results and fusion of systems and highlights the improvements made by our proposed methods that include the automatic segmentation of audio data, language model adaptation, speaker adaptive training of deep neural network models, and the NICT SprinTra decoder. Our experimental results indicated that our proposed methods offer good performance improvements on lecture speech recognition tasks. Our results denoted a 13.5% word error rate on the IWSLT 2013 ASR English test data set.