ng12@interspeech_2012@ISCA

Total: 1

#1 An alignment matching method to explore pseudosyllable properties across different corpora [PDF] [Copy] [Kimi1]

Authors: Raymond W. M. Ng ; Thomas Hain ; Keikichi Hirose

A pseudosyllable unit was derived for English read speech recognition. It is a question whether the pseudosyllable unit can be extracted in a robust manner and how this unit could help in the speech recognition process by providing some indications to the error pattern. In this study, an evaluation method which maps every hypothesis phoneme to every reference is proposed. Analysis is done to the pseudosyllables extracted from two different sets of speech data. Mutual information is used to look at the relationship between different pseudosyllable aspects and error pattern of the hypothesis phoneme. It was shown that the pseudosyllable extraction algorithm is robust and gives units with consistent nature. Pseudosyllables which have a phone triplet structure tends to have lower insertion. Pseudosyllables which overlap with their neighbours are places where more insertion errors may occur.