choi20@interspeech_2020@ISCA

Total: 1

#1 VCTUBE : A Library for Automatic Speech Data Annotation [PDF] [Copy] [Kimi1] [REL]

Authors: Seong Choi, Seunghoon Jeong, Jeewoo Yoon, Migyeong Yang, Minsam Ko, Eunil Park, Jinyoung Han, Munyoung Lee, Seonghee Lee

We introduce an open-source Python library, VCTUBE, which can automatically generate <audio, text> pair of speech data from a given Youtube URL. We believe VCTUBE is useful for collecting, processing, and annotating speech data easily toward developing speech synthesis systems.