choi20@interspeech_2020@ISCA

Total: 1

#1 VCTUBE : A Library for Automatic Speech Data Annotation [PDF] [Copy] [Kimi1]

Authors: Seong Choi ; Seunghoon Jeong ; Jeewoo Yoon ; Migyeong Yang ; Minsam Ko ; Eunil Park ; Jinyoung Han ; Munyoung Lee ; Seonghee Lee

We introduce an open-source Python library, VCTUBE, which can automatically generate <audio, text> pair of speech data from a given Youtube URL. We believe VCTUBE is useful for collecting, processing, and annotating speech data easily toward developing speech synthesis systems.