Bayesian mixture of probabilistic linear regressions for voice conversion

#1 Bayesian mixture of probabilistic linear regressions for voice conversion [PDF] [Copy] [Kimi¹]

The objective of voice conversion is to transform the voice of one speaker to make it sound like another. The GMM-based statistical mapping technique has been proved to be an efficient method for converting voices. We generalized this technique to Mixture of Probabilistic Liner Regressions (MPLR) by using general mixture model of source vectors. In this paper, we improve MPLR by considering a prior for the transformation parameters of liner regressions, which leads to Bayesian Mixture of Probabilistic Liner Regressions (BMPLR). BMPLR has the effectiveness and robustness of Bayesian inference. Especially when the number of training data is limited and the mixture number is larger, BMPLR can largely relieve the overfitting problem. This paper presents two formulations for BMPLR, depending on how to model noise in probabilistic regression function. In addition, we derive equations for MAP estimation of transformation parameters. We examine the proposed method on voice conversion of Japanese utterances. The experimental results exhibit that BMPLR achieves better performance than MPLR.

li12@interspeech_2012@ISCA

#1 Bayesian mixture of probabilistic linear regressions for voice conversion [PDF] [Copy] [Kimi1]

#1 Bayesian mixture of probabilistic linear regressions for voice conversion [PDF] [Copy] [Kimi¹]