Discontinuous observation HMM for prosodic-event-based F0 generation

koriyama12@interspeech_2012@ISCA

Total: 1

#1 Discontinuous observation HMM for prosodic-event-based F0 generation [PDF] [Copy] [Kimi¹] [REL]

Authors: Tomoki Koriyama, Takashi Nose, Takao Kobayashi

This paper examines F0 modeling and generation techniques for spontaneous speech synthesis. In the previous study, we proposed a prosodic-unit HMM where the synthesis unit is defined as a segment between two prosodic events represented by a ToBI label framework. To take the advantage of the prosodic-unit HMM, continuous F0 sequences must be modeled from discontinuous F0 data including unvoiced regions. The conventional F0 models such as the MSD-HMM and the continuous F0 HMM are not always appropriate for such demand. To overcome this problem, we propose an alternative F0 model named discontinuous observation HMM (DO-HMM) where the unvoiced frames are regarded as missing data. We objectively evaluate the performance of the DO-HMM by comparing it with the conventional F0 modeling techniques and discuss the results.

Subject: INTERSPEECH.2012 - Speech Synthesis

koriyama12@interspeech_2012@ISCA

#1 Discontinuous observation HMM for prosodic-event-based F0 generation [PDF] [Copy] [Kimi1] [REL]

#1 Discontinuous observation HMM for prosodic-event-based F0 generation [PDF] [Copy] [Kimi¹] [REL]