aylett09@interspeech_2009@ISCA

Total: 1

#1 Speech synthesis without a phone inventory [PDF] [Copy] [Kimi¹] [REL]

Authors: Matthew P. Aylett, Simon King, Junichi Yamagishi

In speech synthesis the unit inventory is decided using phonological and phonetic expertise. This process is resource intensive and potentially sub-optimal. In this paper we investigate how acoustic clustering, together with lexicon constraints, can be used to build a self-organised inventory. Six English speech synthesis systems were built using two frameworks, unit selection and parametric HTS for three inventory conditions: 1) a traditional phone set, 2) a system using orthographic units, and 3) a self-organised inventory. A listening test showed a strong preference for the classic system, and for the orthographic system over the self-organised system. Results also varied by letter to sound complexity and database coverage. This suggests the self-organised approach failed to generalise pronunciation as well as introducing noise above and beyond that caused by orthographic sound mismatch.

Subject: INTERSPEECH.2009 - Speech Synthesis

aylett09@interspeech_2009@ISCA

#1 Speech synthesis without a phone inventory [PDF] [Copy] [Kimi1] [REL]

#1 Speech synthesis without a phone inventory [PDF] [Copy] [Kimi¹] [REL]