2508.08715

Total: 1

#1 MultiGen: Child-Friendly Multilingual Speech Generator with LLMs [PDF2] [Copy] [Kimi3] [REL]

Authors: Xiaoxue Gao, Huayun Zhang, Nancy F. Chen

Generative speech models have demonstrated significant potential in improving human-machine interactions, offering valuable real-world applications such as language learning for children. However, achieving high-quality, child-friendly speech generation remains challenging, particularly for low-resource languages across diverse languages and cultural contexts. In this paper, we propose MultiGen, a multilingual speech generation model with child-friendly interaction, leveraging LLM architecture for speech generation tailored for low-resource languages. We propose to integrate age-appropriate multilingual speech generation using LLM architectures, which can be used to facilitate young children's communication with AI systems through culturally relevant context in three low-resource languages: Singaporean accent Mandarin, Malay, and Tamil. Experimental results from both objective metrics and subjective evaluations demonstrate the superior performance of the proposed MultiGen compared to baseline methods.

Subjects: Audio and Speech Processing , Artificial Intelligence , Computation and Language , Signal Processing

Publish: 2025-08-12 07:58:48 UTC