soufifar13@interspeech_2013@ISCA

Total: 1

#1 Regularized subspace n-gram model for phonotactic ivector extraction [PDF] [Copy] [Kimi1]

Authors: Mehdi Soufifar ; Lukáš Burget ; Oldřich Plchot ; Sandro Cumani ; Jan Černocký

Phonotactic language identification (LID) by means of n-gram statistics and discriminative classifiers is a popular approach for the LID problem. Low-dimensional representation of the n-gram statistics leads to the use of more diverse and efficient machine learning techniques in the LID. Recently, we proposed phototactic iVector as a low-dimensional representation of the n-gram statistics. In this work, an enhanced modeling of the n-gram probabilities along with regularized parameter estimation is proposed. The proposed model consistently improves the LID system performance over all conditions up to 15% relative to the previous state of the art system. The new model also alleviates memory requirement of the iVector extraction and helps to speed up subspace training. Results are presented in terms of Cavg over NIST LRE2009 evaluation set.