seide11@interspeech_2011@ISCA

Total: 1

#1 Conversational speech transcription using context-dependent deep neural networks [PDF] [Copy] [Kimi1]

Authors: Frank Seide ; Gang Li ; Dong Yu

We apply the recently proposed Context-Dependent Deep-Neural-Network HMMs, CD-DNN-HMMs, to speech-to-text transcription. For single-pass speaker-independent recognition on the RT03S Fisher portion of phone-call transcription benchmark (Switchboard), the word-error rate is reduced from 27.4%, obtained by discriminatively trained Gaussian-mixture HMMs, to 18.5%.a 33% relative improvement.