Eigenvoice conversion based on Gaussian mixture model

toda06@interspeech_2006@ISCA

Total: 1

#1 Eigenvoice conversion based on Gaussian mixture model [PDF] [Copy] [Kimi] [REL]

Authors: Tomoki Toda, Yamato Ohtani, Kiyohiro Shikano

This paper describes a novel framework of voice conversion (VC). We call it eigenvoice conversion (EVC). We apply EVC to the conversion from a source speakers voice to arbitrary target speakers voices. Using multiple parallel data sets consisting of utterance-pairs of the source and multiple pre-stored target speakers, a canonical eigenvoice GMM (EV-GMM) is trained in advance. That conversion model enables us to flexibly control the speaker individuality of the converted speech by manually setting weight parameters. In addition, the optimum weight set for a specific target speaker is estimated using only speech data of the target speaker without any linguistic restrictions. We evaluate the performance of EVC by a spectral distortion measure. Experimental results demonstrate that EVC works very well even if we use only a few utterances of the target speaker for the weight estimation.

Subject: INTERSPEECH.2006 - Speech Synthesis